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A THERMOSTABLE LIGASE MEDIATED DNA AMPLIFICATION 
SYSTEM FOR THE DETECTION OF GENETIC DISEASES 

More than 2,000 conditions have been identified as single- 
5 gene defects for which the risk of producing affected offspring 
can be mathematically predicted. Among these conditions in man 
include Huntington's chorea, cystic fibrosis, alphai antitrypsin 
deficiency, muscular dystrophy, Hunter's syndrome, Lesch-Nyhan 
syndrome, Down's syndrome, Tay-Sachs disease, hemophilias, 
1 0 phenylketonuria, thalasemias, and sickle-cell anemia. 

Three important techniques have been developed recently 
for directly detecting these single nucleic acid base pair changes, 
deletions, insertions, translocations or other mutations. 
However, two of these techniques cannot be easily automated. In 

1 5 the first such technique, the presence or absence of the mutation 

in a patient's clinical sample is detected by analysis of a 
restriction digest of the patient's DNA using Southern blotting 
[see Journal of Molecular Biology 98:503 (1975)]. However, the 
Southern blotting technique cannot be used for genetic diseases 
20 where the mutation does not alter a restriction site as, for 

example in alphai antitrypsin deficiency. The second technique is 
by the use of DNA probes which involves the synthesis of an 
oligonucleotide of about 19 base pairs that is complementary to 
the normal DNA sequence around the mutation site. The probe is 

2 5 labelled and used to distinguish normal from mutant genes by 

raising the stringency of hybridization to a level which the probe 



WO 91/17239 



2 



PCI7US91/02968 



will hybridize stably to the normal gene, but not to the mutant 
gene with which it has a single base pair mismatch [see Proc. 
Natl. Acad. Sci. USA 80:278 (1983)]. The original method has been 
modified by immobilizing the oligonucleotide and probing with a 
5 labelled PCR amplified sample. In this modification, the sample 
is allowed to hybridize to an immobilized oligonucleotide and 
then washed off by raising the stringency of hybridization as 
described above [see Proc. Natl, Acad. Sci. USA 86:6230 (1989)]. 
Other methods have been developed which use fluorescent PCR 
1 0 primers to specifically amplify only one mutation or allele [see 
Proc. Natl. Acad. Sci. USA 86:9178 (1989)]. This method requires 
the separation of products from primers by spin columns or gel 
electrophoresis and hence is not amenable to large scale 
automation. The third technique utilizes the presence of both 

1 5 diagnostic and contiguous probes under conditions wherein the 

diagnostic probe remains substantially covalently bound to the 
contiguous probe only in the case wherein the sample nucleic acid 
contains the exact target sequence. In addition, the diagnostic 
oligonucleotide probe may contain a "hook" (for example, a 
20 biotinylated oligonucleotide) which is captured (for example, by 
streptavidin) as a means of increasing the efficiency of the 
technique, and the contiguous probe may contain a detectable 
moiety or label [see Science 241:1077 (1988) and U.S. Patent 
4,883,750]. 

2 5 Although it is not always necessary, the detection of single 

base pair mutations in DNA is usually preceded by techniques to 
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increase or amplify the amount of DNA sample material. A 
number of techniques exist to perform nucleic acid amplification, 
among which are: (1) polymerase chain reaction which can amplify 
DNA a million fold from a single copy in a matter of hours using 
5 Taq polymerase and running 20 to 30 reaction cycles on a 

temperature cycling instrument [see Science 239:487 (1988), and 
United States Patents 4,683,195, 4,683,202, and 4,800,159]; (2) 
self-sustained sequence replication or 3SR can amplify DNA or 
RNA 10 million fold from a single copy in less than an hour using 
1 0 reverse transcriptase, T7 RNA polymerase, and RNase H under 
isothermal conditions at 37°C [see Proc. Natl. Acad. Sci. USA 
87:1874 (1990)]; and (3) Q Beta Replicase can replicate a few 
thousand RNA molecules containing a special 300bp recognition 
sequence a billion fold in 30 minutes. Additional techniques are 
15 available, and one, the ligase chain reaction, is discussed in the 
following description of the cloned thermophilic ligase according 
to the present invention. 

In addition to various genetic diseases which may be 
diagnosed utilizing the present invention, various infectious 
20 diseases can be diagnosed by the presence in a clinical sample of 
a specific DNA sequence characteristic of the causative 
microorganism. These include bacteria, viruses, and parasites. In 
such procedures, a relatively small number of pathogenic 
organisms may be present in a clinical sample from an infected 
2 5 patient and the DNA extracted from these organisms may 
constitute only a very smali fraction of the total DNA in the 
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sample. However, specific amplification of suspected pathogen- 
specific sequences prior to immobilization and detection by 
hybridization of the DNA samples should greatly improve the 
sensitivity and specificity of traditional procedures. In addition, 
5 amplification is particularly useful if such an analysis is to be 
done on a small sample using nonradioactive detection techniques 
which may be inherently insensitive, or where radioactive 
techniques are employed, but where rapid detection is desirable. 
Although techniques such as these are available, the search 

10 for other techniques for determining single base pair mutations 
continues. The present invention, that is DNA amplification 
and/or detection by a ligase detection reaction (LDR) or ligase 
chain reaction (LCR) utilizing the thermophilic DNA ligase from 
Thermus aquaticus to detect a target DNA sequence is part of that 

15 continuing effort. 

Although other techniques utilizing E. coli or T4 DNA ligase 
for DNA amplification have been attempted, these have been found 
to be unacceptable because of a high background "noise" levels 
(after as few as 10 cycles), a condition which does not exist in 

20 the ligase chain reaction according to the present invention. 

DNA amplification and/or detection has also been attempted 
utilizing specific ligases. For example, a ligase amplification 
reaction has been reported [see Gene 76:245 (1989)] that can 
amplify DNA starting with 500,000 copies in 95 hours, using 75 

2 5 cycles and replenishing the T4 DNA ligase used after each cycle. 
However, this reported technique is slow and requires the 
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addition of fresh T4 ligase at each step, both of which 
requirements make this reported technique unacceptable for 
automation. The ligase chain reaction according to the present 
invention allows for amplification of DNA from 200 copies in 3 
5 hours using 30 cycles and does not require the addition of ligase 
following each cycle. 

Throughout the following description of the present 
invention, terminology specific to the technology field will be 
used. In order to avoid any misunderstandings as to what is being 

1 0 referenced, and to provide the reader with a clear understanding 

of what is being described, the following definitions will be used: 

"Amplification" refers to the increase in the number of 
copies of a particular nucleic acid fragment resulting either from 
an enzymatic chain reaction (such as a polymerase chain reaction, 
15 a ligase chain reaction, or a self-sustained sequence replication), 
or from the replication of the vector into which it has been 
cloned. 

"Blunt end ligation" refers to the covalent linkage of two 
ends of DNA that are completely flush, i.e. have no cohesive end 

2 0 overhangs. 

"Cell", "cell line", and "cell culture" may be used 
interchangeably and all such designations include progeny. Thus, 
the words "transformants" or "transformed cells" includes the 
primary subject cell and cultures derived therefrom without 
2 5 regard for the number of transfers. It is also understood that all 
progeny may not be precisely identical in DNA content due to 
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deliberate or inadvertent mutations. However, all mutant progeny 
having the same functionality as screened for in the originally 
transformed cell are included. 

"Clone" refers to a group of genetically identical molecules, 
5 cells or organisms asexually descended from a common ancestor. 
"Cloning" is the process of propagating such identical molecules, 
cells or organisms. Recombinant DNA techniques make it possible 
to clone individual genes; this is referred to as "molecular 
cloning". 

0 "Covalently attaching" refers to forming a covalent 

chemical bond between two substances. 

"Cycle" refers to a single melting and cooling of DNA. For 
example, at very high temperatures such as 94°C, virtually all 
double stranded DNA (independent of length) unwinds and melts. 

5 If one cools the temperature (to 45-65°C) in the presence of 

complementary oligonucleotides, they can hybridize to the correct 
sequences of the unwound melted DNA. DNA that has been melted 
and cooled in the presence of complementary oligonucleotides is 
now a substrate for the DNA ligase reaction. See "T m ". 

0 "Diagnostic portion" refers to that portion of the target 

sequence which contains the nucleotide change, the presence or 
absence of which is to be detected. "Contiguous portion" refers to 
a sequence of DNA which is a continuation of the nucleotide 
sequence of that portion of the sequence chosen as diagnostic. 

5 The continuation can be in either direction . 
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It will be recognized, based on the following description, 
that the precise position of the selected oligonucleotide 
containing the diagnostic portion is arbitrary, except that it must 
contain the nucleotide(s) which differentiate the presence or 
5 absence of the target sequence at one of its ends. Thus, the 
oligonucleotide containing the contiguous portion continues the 
sequence of this arbitrarily chosen oligonucleotide containing the 
diagnostic portion such that the diagnostic nucleotide(s) is at the 
junction of the two oligonucleotides. 
1 0 "Endonuclease" refers to an enzyme (e.g., restriction 

endonuclease, DNase I) that cuts DNA at sites within the 
molecule. 

"Expression system" refers to DNA sequences containing a 
desired coding sequence and control sequence in operable linkage 

1 5 in such a manner that hosts transformed with these sequences are 

capable of producing the encoded proteins. In order to effect 
transformation, the expression system may be included on a 
vector, or the transformed vector DNA may also be integrated into 
the host chromosome. 

2 0 "Gene" refers to a DNA sequence which encodes a 

recoverable bioactive polypeptide or precursor. The polypeptide 
can be encoded by a full-length gene sequence or any portion of 
the coding sequence so long as the enzymatic activity is retained. 
"Gene library" or "library" refers to a collection of 
2 5 randomly-cloned fragments that encompass substantially the 
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entire genome of a given species. This is also referred to as a 
clone bank or shotgun collection. 

"Genome" refers to the entire DNA of an organism. r 
"Hook" refers to a modification of a probe that enables the 
5 user to rapidly and conveniently isolate probes containing this 
modification by "catching" the hook. The interaction between 
hook and catching mechanism can be, for example, covalent 
bonding or ligand/receptor binding of sufficient affinity. Such 
hooks may include antigens which can be recovered by antibody, 
10 biotin which can be recovered by avidin or streptavidin, specific 
DNA sequences which can be recovered by complementary nucleic 
acid, or DNA binding proteins (repressors), and specific reactive 
chemical functionalities which can be recovered by other 
appropriate reactive groups. 

1 5 "Hybridization" and "binding" in the context of probes and 

denatured melted DNA are used interchangeably. Probes which are 
hybridized or bound to denatured DNA are base paired or 
"aggregated" to complementary sequences in the polynucleotide. 
Whether or not a particular probe remains base paired or 

2 0 aggregated with the polynucleotide depends on the degree of 

complementarity, the length of the probe, and the stringency of 
the binding conditions. The higher the stringency, the higher must 
be the degree of complementarity, and/or the longer the probe. 
"Klenow fragment" refers to a 76,000 dalton polypeptide 
2 5 obtained by partial proteolytic digestion of DNA polymerase I. 
This enzyme possesses the 5'~>3' polymerase and 3'— >5' 
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exonuclease activities, but not the 5'— >3' exonuclease activity 
of DNA polymerase I. 
* "Label" refers to a modification to the probe nucleic acid 

which enables the user to identify the labelled nucleic acid in the 
5 presence of unlabelled nucleic acid. Most commonly, this is the 
replacement of one or more atoms with radioactive isotopes. 
However, other labels may be substituted for the isotopes as, for 
example, covalently attached chromophores, fluorescent moieties, 
enzymes, antigens, groups with specific reactivity, 
10 chemiluminescent moieties, and electrochemically detectable 
moieties. 

"Ligase M refers to an enzyme which catalyses the formation 
of a phosphodiester bond at the site of a single-stranded break in 
duplex DNA. The iigase enzyme also catalyses the covalent 

1 5 linkage of duplex DNA; blunt end to blunt end, or one cohesive end 
to another complementary cohesive end. 

"Ligase Chain Reaction (LCR)" refers to the amplification of 
a oligonucleotide ligation product. For example, if 
oligonucleotides are designed such that the DNA products of one 

2 0 cycle can become the DNA substrates of the next cycle, repeating 
such cycles will cause an exponential amplification of the DNA (a 
"chain reaction"). As a thermophilic ligase enzyme is capable of 
remaining active during many DNA melting and cooling cycles, 
this allows a DNA amplification to occur rapidly and 

25 automatically in a single reaction vessel subject to many thermal 
cycles in which the oligonucleotide ligation product is amplified. 
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"Ligase detection reaction (LDR)" refers to the use of two 
adjacent oligonucleotides for the detection of specific sequences 
with the aid of a thermophilic ligase with linear product 
amplification. 

5 "Ligase DNA sequence" refers to the DNA sequence in 

Thermus aqauticus HB8 for the thermophilic ligase of the present 
invention which comprises, at the amino terminus of the ligase 
protein, the following nucleic acid sequence: 

10 TCGGAAIAGG GGATGOGCOC CTAGTCCAAG GGAAAGTATA GCCCAAGGTA 
CACTAGGGCC 



ATG 


ACC 


CTG 


GAA 


GAG 


GOG 


AGG 


AAG 


CGG 


GTA 


AAC 


GAG 


TTA 


CGG 


GAC 


CTC 


ATC 


CGC 


TAC 


CAC 


AAC 


TAC 


CGC 


TAC 


TAC 


GTC 


CTG 


GOG 


GAC 


COG 


GAG 


ATC 


TOC 


GAC 


GCC 


GAG 


TAC 


GAC 


CGG 


CTT 


CTT 


AGG 


GAG 


CTC 


AAG 


GAG 


CTT 


GAG 


GAG 


CGC 


TTC 


CCC 


GAG 


CTC 


AAA 


AGO 


OCG 


GAC 


TOC 


CCC 


ACC 


CTT 


CAG 


GTG 


GGG 


GOG 


AGG 


OCT 


TTG 


GAG 


GCC 


ACC 


TTC 


CGC 


CCC 


GTC 


CGC 


CAC 


CCC 


ACC 


CGC 


ATG 


TAC 


TCC 


TTG 


GAC 


AAC 


GCC 


TTT 


AAC 


CTT 


GAC 


GAG 


CTC 


AAG 


GCC 


TTT 


GAG 


GAG 


CGG 


ATA 


GAA 


CGG 


GCC 


CTG 


GQG 


CGG 


AAG 


GGC 


CCC 


TTC 


GCC 


TAC 


ACC 


GTG 


GAG 


CAC 


AAG 


GTG 


GAC 


GGG 


CTT 


TCC 


GTG 


AAC 


CTC 


TAC 


TAC 


GAG 


GAG 


GGG 


GTC 


CTG 


GTC 


TAC 


GGG 


GCC 


ACC 


GCC 


GGG 


GAC 


GGG 


GAG 


GTG 


GGG 


GAG 


GAG 


GTC 


ACC 


CAG 


AAC 


CTC 


CTC 


ACC 


ATC 


CCC 


ACC 


ATC 


OCG 


AGG 


AGG 


CTC 


AAG 


GGG 


GTG 


OCG 


GAG 


CGC 


CTC 


GAG 


GTC 


CGG 


GGG 


GAG 


GTC 


TAC 


ATG 


CCC 


ATA 


GAG 


GCC 


TTC 


CTC 


CGG 


CTC 


AAC 


GAG 


GAG 


CTG 


GAG 


GAG 


CGG 


GGG 


GAG 


AGG 


ATC 


TTC 


AAA 


AAC 


OCT 


AGG 


AAT 


GGG 


GOG 


GOG 


GGT 


TCC 


TTA 


AGG 


CAA 


AAA 


GAC 


CCC 


CGC 


ATC 


ACC 


GCC 


AAG 


CGG 


GGC 


CTC 


AGG 


GCC 


ACC 


TTC 


TAC 


GCC 


TTA 


GGG 


CTT 


GGG 


CTG 


GAG 


GAG 


GTG 


GAG 


AGG 


GAA 


GGG 


GTG 


GGG 


ACC 


CAG 


TTT 


GCC 


CTC 


CTC 


CAC 


TGG 


CTC 


AAG 


GAA 


AAA 


GGC 


TTC 


CGC 


GTG 


GAG 


CAC 


GGC 


TAC 


GCC 


CGG 


GCC 


GTG 


GGG 


GOG 


GAA 


GGG 


GTG 


GAG 


GGG 


GTC 


TAC 


CAG 


GAC 


TGG 


CTC 


AAG 


AAG 


CGG 


CGG 


GOG 


CTT 


CCC 


TTT 


GAG 


GGG 


GAC 


GGG 


GTG 


GTG 


GTG 


AAG 


CTG 


GAC 


GAG 


CTT 


GCC 


CTT 


TQG 


CGG 


GAG 


CTC 


GGC 


TAC 


ACC 


GCC 


CGC 


GCC 


CCC 


CGG 


TTC 


GCC 


ATC 



WO 91/17239 



PCI7US91/02968 



GCC TAG AAG TTC CCC GCC GAG GAG AAG GAG ADC CGG CTT TTG GAC 
GTG GTC TTC CAG GTG GGG GGC ACC GGG CGG GTG ACC CCC GTG GGG 
. ATC CTC GAG CCC GTC TTC CTA GAG GGC AGC GAG GTC TCC CGG GTC 
ACC CTG CAC AAC GAG AGC TAG ATA GAG GAG TTG GAC ATC CGC ATC 
5 GGG GAC TGG GTT TTG GTG CAC AAG GCG GGC GGG GTC ATC CCC GAG 
GTC CTC CGG GTC CTC AAG GAG AGG CGC ACG GGG GAG GAA AGG CCC 
ATT CGC TGG CCC GAG ACC TGC CCC GAG TGC GGC CAC CGC CTC CTC 
AAG GAG GGG AAG GTC CAC CGC TGC CCC AAC CCC TTG TGC CCC GCC 
AAG CGC TTT GAG GCC ATC CGC CAC TTC GCC TCC CGC AAG GCC ATG 
10 GAC ATC CAG GGC CTG GGG GAA AAG CTC ATT GAG AGG CTT TTG GAA 
AAG GGG CTG GTC AAG GAC GTG GCC GAC CTC TAG CGC TTG AGA AAG 
GAA GAC CTG GTG GGC CTG GAG CGC ATG GGG GAG AAG AGC GCC CAA 
AAC CTC CTC CGC GAG ATA GAG GAG AGC AAG AAA AGA GGC CTG GAG 
CGC CTC CTC TAG GCC TTG GGG CTT CCC GGG GTG GGG GAG GTC TTG 

1 5 GCC CGG AAC CTG GCG GCC OGC TTC GGG AAC ATG GAC CGC CTC CTC 

GAG GCC AGC CTG GAG GAG CTC CTG GAG GTG GAG GAG GTG GGG GAG 
CTC ACG GCG AGG GCC ATC CTG GAG ACC TTG AAG GAC CCC GCC TTC 
CGC GAC CTG GTA CGG AGG CTC AAG GAG GCG GGG GTG GAG ATG GAG 
GCC AAG GAG AAG GGC GOG GAG GCC CTT AAA GGG CTC ACC TCC GTG 

2 0 ATC ACC GGG GAG CTT TCC CGC CCC CGG GAA GAG GTG AAG GCC CTC 

CTA AGG OGC CTC GGG GCC AAG GTG ACG GAC TCC GTG AGC CGG AAG 
ACG AGC TAG CTC GTG GTG GGG GAG AAC COG GGG GAG AAC COG GGG 
AGC AAG CTG GAG AAG GCC AGG GCC CTC GGG GTC CCC ACC CTC ACG 
GAG GAG GAG CTC TAG CGG CTC CTG GAG GCG CGG ACG GGG AAG AAG 
25 GCG GAG GAG CTC GTC TAA AQGCTTCC. 

The corresponding amino acids are: 

Met Thr Leu Glu Glu Ala Arg Lys Arg Val Asn Glu Leu Arg Asp 
Leu He Arg Tyr His Asn Tyr Arg Tyr Tyr Val Leu Ala Asp Pro 
Glu He Ser Asp Ala Glu Tyr Asp Arg Leu Leu Arg Glu Leu Lys 

3 0 Glu Leu Glu Glu Arg Phe Pro Glu Leu Lys Ser Pro Asp Ser Pro 

Thr Leu Glu Val Gly Ala Arg Pro Leu Glu Ala Thr Phe Arg Pro 
Val Arg His Pro Thr Arg Mat Tyr Ser Leu Asp Asn Ala Phe Asn 
. Leu Asp Glu Leu Lys Ala Phe Glu Glu Arg He Glu Arg Ala Leu 
Gly Arg Lys Gly Pro Phe Ala Tyr Thr Val Glu His Lys Val Asp 
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Gly Leu Ser 
Gly Ala Thr 
Asn Leu Leu 
Pro Glu Arg 
5 Ala Phe Leu 
lie Phe Lys 
Lys Asp Pro 
Tyr Ala Leu 
Ala Thr Gin 

1 0 Pro Val Glu 

Glu Ala Val 
Phe Glu Ala 
Try Arg Glu 
Ala Tyr Lys 

15 Val Val Phe 
lie Leu Glu 
Thr Leu His 
Gly Asp Trp 
Val Leu Arg 

20 lie Arg Trp 
Lys Glu Gly 
Lys Arg Phe 
Asp lie Gin 
Lys Gly Leu 

2 5 Glu Asp Leu 

Asn Leu Leu 
Arg Leu Leu 
Ala Arg Asn 
Glu Ala Ser 

3 0 Leu Thr Ala 

Arg Asp Leu 
Ala Lys Glu 
He Thr Gly 
Leu Arg Arg 



Val Asn Leu Tyr 
Arg Gly Glu Gly 
Thr He Pro Thr 
Leu Glu Val Arg 
Arg Leu Asn Glu 
Asn Pro Arg Asn 
Arg lie Thr Ala 
Gly Leu Gly Leu 
Phe Ala Leu Leu 
His Gly Tyr Ala 
Tyr Gin Asp Trp 
Asp Gly Val Val 
Leu Gly Tyr Thr 
Phe Pro Ala Glu 
Gin Val Gly Arg 
Pro Val Phe Leu 
Asn Glu Ser Tyr 
Val Leu Val His 
Val Leu Lys Glu 
Pro Glu Thr Cys 
Lys Val His Arg 
Glu Ala He Arg 
Gly Leu Gly Glu 
Val Lys Asp Val 
Val Gly Leu Glu 
Arg Gin He Glu 
Tyr Ala Leu Gly 
Leu Ala Ala Arg 
Leu Glu Glu Leu 
Arg Ala He Leu 
Val Arg Arg Leu 
Lys Gly Gly Glu 
Glu Leu Ser Arg 
Leu Gly Ala Lys 



Tyr Glu Glu Gly 
Glu Val Gly Glu 
Lie Pro Arg Arg 
Gly Glu Val Tyr 
Glu Leu Glu Glu 
Ala Ala Ala Gly 
Lys Arg Gly Leu 
Glu Glu Val Glu 
His Trp Leu Lys 
Arg Ala Val Gly 
Leu Lys Lys Arg 
Val Lys Leu Asp 
Ala Arg Ala Pro 
Glu Lys Glu Thr 
Thr Gly Arg Val 
Glu Gly Ser Glu 
He Glu Glu Leu 
Lys Ala Gly Gly 
Arg Arg Thr Gly 
Pro Glu Cys Gly 
Cys Pro Asn Pro 
His Phe Ala Ser 
Lys Leu He Glu 
Ala Asp Leu Tyr 
Arg Mat Gly Glu 
Glu Ser Lys Lys 
Leu Pro Gly Val 
Phe Gly Asn Met 
Leu Glu Val Glu 
Glu Thr Leu Lys 
Lys Glu Ala Gly 
Ala Leu Lys Gly 
Pro Arg Glu Glu 
Val Thr Asp Ser 



Val 


Leu 


Val 


Tyr 


Glu 


Val 


Thr 


Gin 


Leu 


Lys 


Gly 


Val 


Met 


Pro 


lie 


Glu 


Arg 


Gly 


Glu 


Arg 


Ser 


T 

Leu 


Arg 


Gin 


Arg 


Ala 


Thr 


Phe 


Arg 


Glu 


Gly 


Val 


GlU 


Lys 


Gly 


Phe 


Ala 


Glu 


Gly 


Val 


Arg 


Ala 


Leu 


Pro 


Glu 


Leu 


Ala 


Leu 


Arg 


Phe 


Ala 


He 


Arg 


Leu 


Leu 


Asp 


Thr 


Pro 


Val 


Gly 


val 


Ser 


Arg 


T T_ 1 

Val 


"TV 

Asp 


lie 


Arg 


He 


val 


lie 


Pro 


Glu 


Glu 


Glu 


Arg 


Pro 


His 


Arg 


Leu 


T 

Leu 


Leu 


Cys 


Pro 


TV T 

Ala 


Arg 


Lys 


Ala 




Arg 


Leu 


Leu 


Glu 


Arg 


Leu 


Arg 


Lys 


Lys 


oer 


Ala 


Gin 


iiiXj 


loiy 


Leu 


GlU 


Gly 


Glu 


Val 


Leu 


Asp 


Arg 


Leu 


Leu 


Glu 


Val 


Gly 


Glu 


Asp 


Pro 


Ala 


Phe 


Val 


Glu 


MBt 


Glu 


Leu 


Thr 


Phe 


Val 


Val 


Lys 


Ala 


Leu 


Val 


Ser 


Arg 


Lys 
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Thr Ser Tyr Leu Val Val Gly Glu Asn Pro Gly Ser Lys Leu Glu 
Lys Ala Arg Ala Leu Gly Val Pro Thr Leu Thr Glu Glu Glu Leu 
Tyr Arg Leu Leu Glu Ala Arg Thr Gly Lys Lys Ala Glu Glu Leu 
Val 

5 "Ligating" refers to covalently attaching polynucleotide 

sequences together to form a single sequence. This is typically 
performed by treatment with a ligase which catalyzes the 
formation of a phosphodiester bond between the 5' end of one 
sequence and the 3' end of the other. However, in the context of 

10 the invention, the term "ligating" is also intended to encompass 
other methods of covalently attaching such sequences, e.g., by 
chemical means. The terms "covalently attaching" and "ligating" 
may be used interchangeably. 

"Nick closing activity" refers to covalent linkage of 

1 5 adjacent strands of DNA. It may be used to assay for ligase 
activity by virtue of converting open circular DNA (OCDNA) to 
covalently closed circular DNA (CCCDNA) and determining the 
speed at which the specimen DNA migrates on an ethidium 
bromide stained agarose gel (OCDNA migrates slower than 

2 0 CCCDNA). 

"Oligonucleotide" refers to a molecule comprised of two or 
more deoxyribonucleotides or ribonucleotides, preferably more 
than three. Its exact size will depend on the ultimate function or 
use of the oligonucleotide. The oligonucleotide may be derived 
2 5 synthetically or by cloning. 

"Operably linked" refers to juxtaposition such that the 
normal function of the components can be performed. Thus, a 
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coding sequence "operably linked" to control sequences refers to a 
configuration wherein the coding sequences can be expressed 
under the control of the control sequences. 

"Overproducer strain" refers to a strain of bacteria or other 
5 host cell that may be induced to overproduce a particular enzyme 
or chemical substance. 

"Polymerase" refers to enzymes which catalyze the 
assembly of deoxyribonucleotides into DNA. 

"Polymerase chain reaction (PCR)" refers to a patented 
10 process (described in United States Patents 4,683,202 and 

4,683,195) for the exponential amplification of a specific DNA 
fragment by utilizing two oligonucleotide primers that hybridize 
to opposite strands and flank the region of interest in a target 
DNA. The process consists of a repetitive series of cycles 
15 involving template denaturation, primer annealing, and the 
extension of the annealed primers by Taq DNA polymerase. 

"Probe" refers to an oligonucleotide designed to be 
sufficiently complementary to a sequence in a denatured nucleic 
acid to be probed (in relation to its length) to be bound under 
20 selected stringency conditions. "Contiguous probe" describes a 
probe which is complementary to the contiguous portion. 
"Diagnostic probe" describes a probe which is complementary to 
the diagnostic portion. "Target probe" describes a probe which is 
complementary to the target sequence and is made by covalently 
25 attaching (ligating) the diagnostic probe and the contiguous probe. 
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"Reporter group" refers to a group that signifies the 
presence of a particular moiety (see "label"). 

"Restriction endonucleases" refers to those enzymes which 
cut DNA by recognizing specific sequences internal to the 
5 molecule and subsequently cutting the DNA in both strands at 
sites either within or outside of the recognition sequence. 

"Sticky end ligation" refers to the covalent linkage of two 
ends of DNA that contain complementary 5' or 3' single strand 
overhangs which are usually, but not limited to, one to five 

1 0 nucleotides in length. 

"Stringency" refers to the combination of conditions to 
which nucleic acids are subject that cause the double stranded 
DNA to dissociate into component single strands; among these are 
pH extremes, high temperature, and salt concentration. "High 
1 5 stringency" refers to the conditions, specifically hybridization 
and washing, which are sufficient to allow for the detection of 
unique sequences using an oligonucleotide probe or closely related 
sequence under standard Southern hybridization protocols [as 
described in J. Mol. Biol. 98:503 (1975)]. 

2 0 "T m " refers to the temperature at which two complementary 

strands of DNA unwind and separate. This is a function of the 
single stranded DNA length and its base composition - for small 
fragments, an approximate value of T m in °C is equal to 4(G+C) + 
2(A+T). For example, an oligonucleotide which has 5G, 7C, 5A, and 
2 5 4T bases has a temperature of 4(5+7) + 2(5+4) or 66°C. 
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"Target sequence" refers to a nucleic acid sequence, the 
presence or absence of which is desired to be detected. In the 
context of a preferred application of the method according to the 
present invention, it is a sequence which forms part of a coding 
5 region in a gene associated with a genetic disease, such as 
sickle-cell anemia. In many such diseases, the presence of the 
genetic aberration is characterized by small changes in the coding 
sequence; most frequently, normal individuals have sequences 
which differ by one nucleotide from the corresponding sequences 

1 0 present in individuals with the genetic "deficiency." In the 

method according to the present invention, either the normal or 
altered sequence can be used as the target sequence. 

"Thermophilic enzyme" refers to an enzyme which functions 
at high temperatures of 50 to 90°C; some may survive brief 
15 exposure to temperatures of 94 to 100°C at which normal 
enzymes denature and thus become inactive. 

"Thermostable ligase" refers to an enzyme which is stable 
to heat, is heat resistant, and catalyzes (facilitates) ligation, at 
high temperatures of 50 to 90°C, of adjacent oligonucleotides in 

2 0 the proper manner to form a product which is complementary to 

the target nucleic acid strand. Generally, the enzyme activates 
the 5' end of one oligonucleotide and links this to the 3' strand of 
an adjacent DNA molecules. There may, however, be thermostable 
enzymes which use other mechanisms to covalently attach 
2 5 adjacent oligonucleotides. Thermostable ligase can, under the 
proper conditions, covalently link a number of different nucleic 
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acid substrates at high temperatures of 50 to 90°C, such as 
closing "nicks" in DNA, and sticky end and blunt end ligations. 

The thermostable enzyme according to the present invention 
must satisfy a single criterion to be effective for the 
5 amplification reaction, i.e., the enzyme must not become 
irreversibly denatured (inactivated) when subjected to the 
elevated temperatures for the time necessary to effect 
denaturation of double-stranded nucleic acids. By "irreversible 
denaturation" as used in this connection, is meant a process 

1 0 bringing about a permanent and complete loss of enzymatic 

activity. The heating conditions necessary for denaturation will 
depend, e.g., on the buffer salt concentration and the length and 
nucleotide composition of the nucleic acids being denatured, but 
typically range from about 85°C, for shorter oligonucleotides, to 
15 about 105°C for a time depending mainly on the temperature and 
the nucleic acid length, typically from about 0.25 minutes for 
shorter oligonucleotides, to 4.0 minutes for longer pieces of DNA. 
Higher temperatures may be tolerated as the buffer salt 
concentration and/or GC composition of the nucleic acid is 

2 0 increased. Preferably, the enzyme will not become irreversibly 

denatured at about 90 to 100°C. The thermostable enzyme 
according to the present invention has an optimum temperature at 
which it functions that is greater than about 45°C, probably 
between 50 and 90°C, and optimally between 60 and 80°C. 
25 A more thorough and complete understanding of the cloning 

of the thermophilic ligase sequence and the use of this enzyme in 
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the thermophilic ligase mediated DNA amplification procedure for 
the detection of single base pair sequence differences in genetic 
diseases can be obtained by reference to the following figures and 
examples which are presented by way of illustration only and are 
5 not intended, nor should they be considered, to limit the scope of 
the claimed invention. 

With specific reference to the figures, 

FIG. 1 is a depiction of plasmids pDZ1 and pDZ7; 

FIG. 2 is a flow chart of the Ligase Chain Reaction (LCR) 

1 0 according to the present invention; 

FIG. 3 is an autoradiogram demonstrating the specificity of 
T. aquaticus thermophilic ligase under both LDR and LCR 
amplification conditions according to the present invention; 

FIG. 4 is an autoradiogram demonstrating LCR amplification 
15 at different target concentrations; 

FIG. 5 is an autoradiogram demonstrating the detection of B 
globin alleles using human genomic DNA. 

FIG. 6 is an overview of an ELISA based oligonucleotide 
ligation assay according to the present invention. 

2 0 FIG. 7 is a photographic representation of SDS-10% 

polyacrylamide gel electrophoresis of the thermostable ligase, 
according to the present invention, at different stages of 
purification. 

FIG. 8 is a second photographic representation of SDS-10% 
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polyacrylamide gel electrophoresis of the thermostable ligase, 
according to the present invention, at different stages of 
purification. 

FIG. 9 is a depiction of three clones prepared in accordance 
5 with the present invention. 

In Fig. 7, lanes A and G represent marker proteins 
(molecular weights are given in kd); B represents whole cells 
after induction; C represents crude supernatant after sonication; 
D represents pooled DEAE flow-through after heat treatment; and 

10 E and F represent fractions 23 and 24 after phosphocellulose 
chromatography. In Fig. 8, lanes A and H represent marker 
proteins (molecular weights are given in kd); B represents whole 
cells after induction; C represents crude supernatant after 
sonication; D represents pooled DEAE flow-through after heat 

15 treatment; E represents fraction 23 after phosphocellulose 
chromatography; F represent fraction 23 incubated with nicked 
DNA in ligase buffer in the absence of NAD; and G represents 
fraction 23 incubated with NAD in ligase buffer in the absence of 
nicked DNA. In Fig. 8, the higher molecular weight ligase 

2 0 (approximately 81 kd) is the adenylated form, while lower 

molecular weight ligase (approximately 78 kd) is non-adenylated. 

The plasmids depicted in Fig. 1 have been deposited with, 
and accepted by, a collection agency under the Budapest Treaty 
deposit rules. Plasmid pDZ1 has been incorporated within a host 

2 5 bacteria (£. coli strain AK53), deposited with the American Type 
Culture Collection, and granted the collection number ATCC No. 
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68307. Plasmid pDZ7 has been incorporated within a host 
bacteria (E. coli strain AK53), deposited with the American Type 
Culture Collection, and granted the collection number ATCC No. 
68308. 

5 While other methods may be used, in general, the production 

of the thermophilic ligase according to the present invention will 
be by recombinant means which typically involve the following: 

First, a DNA is obtained which encodes the mature (as used 
herein the term includes all muteins) enzyme or a fusion of the 
1 0 thermophilic ligase to an additional sequence that does not 

destroy its activity or to an additional sequence cleavable under 
controlled conditions to give an active protein. If the sequence is 
uninterrupted by introns, it is suitable for expression in any host. 
However, the sequence should be in an excisable and recoverable 

1 5 form. Using PCR technology, for example, most DNA sequences 

coding for enzymes may be amplified and hence recovered in an 
"excised" form. 

The excised or recovered coding sequence is then placed in 
operable linkage with suitable control sequences in a replicable 

2 0 expression vector which is used to transform a suitable host The 

transformed host is then cultured under suitable conditions to 
effect the production of the recombinant thermophilic ligase, and 
the ligase isolated and purified by known means. 

Each of the above procedures may be accomplished in a 
2 5 variety of ways, For example, the desired coding sequences may 
be obtained from genomic fragments and used directly in 
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appropriate hosts; the constructions for expression vectors 
operable in a variety of hosts are made using appropriate 
replicons and control sequences; and suitable restriction sites 
may, if not normally available, be added to the ends of the coding 
5 sequence so as to provide an excisable gene to insert into the 
appropriate vector. 

The control sequences, expression vectors, and 
transformation methods are dependent on the type of host cell 
used to express the gene. Generally, bacterial hosts are the most 

1 0 efficient and convenient for the production of recombinant 
proteins and therefore preferred for the expression of the 
thermophilic ligase according to the present invention. However, 
other hosts such as yeast, plant, and insect or mammalian cells 
may also be used if convenient. For the purposes of the present 

1 5 invention, one source of the host cell is considered to be 

equivalent to any other available and suitable host cell source. 
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EXAMPLE ! 

(growth of T. aquaticus strain HB8 and isolation of DNA) 
DNA was isolated from Thermus thermophilus strain HB8 
(ATCC No. 27634). This strain has recently been reclassified as 
5 Thermus aqauticus strain HB8 [see Arch. Microbiol 117:189 
(1978)]. 

Cells were grown overnight at 75°C in a water bath shaker 
in TAB broth [see Nuc. Acids Res., pgs 6795-6804 (1981)] (which 
contains per liter, 5 g Bacto™-tryptone, 3 g yeast extract, 2 g 

1 0 NaCI, and 1 g dextrose) adjusted to pH 7.2 - 7.5 with NaOH, and 
harvested by centrifugation to yield 3.1 g wet weight from 800 
ml of media. Cells were resuspended in 15 ml of 50 mM Tris pH 
8.0 buffer containing 50 mM EDTA and 15 mg egg white lysozyme. 
The resuspended cells were lysed by the addition of 2 ml of 10% 

15 (weight/volume) sodium dodecyl sulfate followed by incubation at 
37°C for 15 minutes and two repeated cycles of freezing at -50°C 
and thawing at 37°C. The aqueous solution was extracted 
sequentially with equal volumes of aqueous phenol 
(preequilibrated to pH 7.5 with sodium borate), followed by 

2 0 phenol/chloroform, and finally chloroform. 

Nucleic acids were precipitated by mixing with 2 volumes 
of 95% ethanol, chilling to -50°C for 15 min., and pelleted by 
centrifugation. After removal of the supernatant and drying the 
pellet, nucleic acids were resuspended in 1 ml TE buffer (10 mM 

2 5 Tris HCI, pH 8.0, containing 1 mM EDTA). RNA was digested by the 
addition of 100 jig RNase A to each ml of suspension, and the 
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mixture incubated at 37°C for 1 hr. DNA was precipitated by 
adding 1/1 Oth vol. of 3 M sodium acetate and 3 vol. of 100% 
ethanol, chilled to -50°C for 15 min., pelleted by centrifugation, 
washed with 70% ethanol, and finally resuspended in TE buffer at 
5 a final concentration of 2 mg/ml. 
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Although DNA utilized in the example given above was 
isolated from Thermus aquaticus, the resultant thermophilic 
ligase having the necessary properties according to the present 
invention may have as its initial source DNA isolated from other 
5 Thermus species or other thermophilic bacteria, phages, or 
viruses. 

DNA isolated from T. aquaticus strain HB8 cannot be 
cleaved by the restriction endonucleases Taq I (whose recognition 
sequence is TCGA) or EcoRI (whose recognition sequence is 

1 0 GAATTC). The inability to cleave certain sequences is a 

consequence of protective methylation [see H. O. Smith and S. V. 
Kelly, DNA Methylation: Biochemistry and Biological Significance, 
eds. Razin, Cedar and Riggs, p 39 - 71 , Springer-Verlag Inc., New 
York (1987)] at the N6 position of adenine residues. Previous 

15 investigators [see J. Bact. 169:3243 (1987)] have shown that 
there is a gene, termed mrr, which restricts adenine methylated 
DNA of the form G-6MeANTC and CTGC-6MeAG. In the cloning of 
the Taq I restriction endonuclease and methylase, several E. coli 
strains were found to restrict the TCGA methylated DNA, an 

20 affect originally (but incorrectly) attributed to the mrr gene [see 
Gene 56:13 (1987) and Nuc. Acid Res. 15:9781 (1987)]. Recent 
work conducted at the Cornell University Medical College has 
shown the presence of an additional gene, besides mrr which 
encodes a protein that restricts TCGA methylated DNA. Briefly, 

25 strains containing a Tn5 (KmR) transposon disrupting the mrr 
gene were [see J. Bact. 169:3243 (1987)] used for transduction 
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[according to J. H. Miller in Experiments in Molecular Genetics, 
Cold Spring Harbor Laboratory, pp 201-205 (1972)] of the KmR 
marker into several strains of Escherichia coli that resulted in 
strain converts to a mrr - (defective mrr protein) genotype. None 
5 of these transduced strains could tolerate the Taq methylase 
gene, indicating there is a second gene responsible for the 
restriction of TCGA methylated DNA. Thus, one of the first 
necessary requirements (which prior to the present invention had 
not been apparent) for the making of the present invention was 
1 0 the selection of an E. coli strain which would not heavily restrict 
TCGA methylated DNA. 

In the present invention, a derivative of the RRI strain of E. 
coli which could tolerate the Taq methylase gene and which 
contained a Tn10 (Tc R ) transposon was transduced to a iig.ts7 

1 5 strain [N3098, see Wilson and Murray, J. Mol. Biol. (1979) and J. 

Mol. Biol, 77:531 (1973)] to create E. coli strain AK76. This 
strain has been deposited in the American Type Culture 
Collection, and has been granted the collection number ATCC No. 
55032. This strain contains a temperature sensitive ligase gene, 
20 such that at 42°C the strain cannot grow. This strain can tolerate 
the Taq methylase gene, and other methylated DNA, especially the 
DNA isolated from T. aquaticus. Since it also has a temperature 
sensitive ligase gene, it could be used as a host for the cloning of 
a functional T. aquaticus ligase gene by selecting for growth at 

2 5 42°C. 
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Cloning of the T. aquaticus ligase gene was based on a 
positive selection scheme similar to that described by Wilson and 
Murray. The approach was to construct libraries of T. aquaticus 
DNA inserted into a suitable vector. These libraries were then 
5 introduced, via transformation into a iig.ts7 E. coli strain that did 
not restrict methylated T. aquaticus DNA, such as strain AK76. 
These cells were then grown at the nonpermissive temperature, 
that is at 42°C. Any survivors could be (i) revertants to a lig,+ 
phenotype; (ii) second site revertants that increase expression of 

1 0 the defective £. coli ligase gene product; (iii) a cloned piece of T. 
aquaticus DNA that increases expression of the defective E. coli 
ligase gene product; or (iv) a cloned piece of T. aquaticus DNA 
that contains the T. aquaticus ligase gene. 

For the desired last alternative to work, it is necessary 

15 that (i) the entire ligase gene is cloned; (ii) that either the 

endogenous control sequences for T. aquaticus ligase expression 
function in E. coli, or that exogenous vector control sequences are 
sufficiently close to the amino terminus and the ligase gene is 
cloned in the correct orientation to allow for proper expression in 

20 E. coli; (iii) the T. aquaticus ribosome binding site works in E. 

coli; and (iv) the T. aquaticus ligase is active enough at 42°C, and 
the amount synthesized is sufficient to complement ligase 
function in E. coli without interfering with other processes. 
Construction of the suitable libraries used in the present 

25 invention utilized conventional vectors containing desired control 
sequences, and standard restriction endonuclease and ligation 
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techniques. Purified plasmid DNA, T. aquaticus DNA sequences, or 
synthesized oligonucleotides for use in the present invention, 
were cleaved, tailored, and religated in the form desired also by 
conventional techniques. 
5 The selection of a suitable vector for use in the present 

invention is more than a mere matter of selecting a vector among 
the many which exist and have been used in the past. High copy 
number derivatives of pUC plasmids [see for example, C. Yanisch- 
Peron et al., Gene 33:103 (1985), or J. Vieira et al., Gene 19:259 
10 (1982)] are actually somewhat unstable when grown at 42°C. Low 
copy plasmids such as pBR322 derivatives pFBI 1, 2, 13, 14 and 
15 [see F. Barany, Proc. Natl. Acad. Sci. USA 82:4202 (1985)] may 
not produce enough enzyme to complement the ligase defect. In 
making the present invention, 18 different libraries using 3 

1 5 different sets of vectors were constructed. The successful clone 

was derived from the vector pTZ18R [see D. A. Mead et al., Protein 
Engineering 1:67 (1986)], although other vectors may also be 
utilizable. 

Generally, site-specific DNA cleavage, as more particularly 
20 described in the following example, is performed by treating the 
DNA with a suitable restriction enzyme under conditions which 
are generally understood in the art, and the particulars of which 
are specified by the manufacturers of these commercially 
available restrictions enzymes. In general, about 1 pg of plasmid 

2 5 or DNA sequence is cleaved by two to ten units of enzyme in about 

20 pi of buffer solution. Incubation times of about one to two 
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hours at about 37°C are preferable, although variations in both 
the time and temperature can be tolerated. After each incubation, 
protein is removed by extraction with phenol/chloroform, and may 
be followed by a further extraction. The nucleic acids are 
5 recovered by precipitation with ethanol. If desired, size 
separations of the cleaved fragments may be performed by 
polyacrylamide or agarose gel electrophoresis using standard 
techniques. 
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EXAMPLE II 
(site specific cleavage) 
Site-specific cleavage of both plasmid and T. aquaticus 
DNA was performed using commercially available restriction 
5 endonucleases in standard buffers. 

In general, about 10 ^g or plasmid or T. aquaticus DNA was 
cleaved in 100 jxl of buffer solution by the addition of 20 to 100 
units of the appropriate restriction endonuclease, and incubating 
the mixture at 37°C for 1 to 2 hrs. 
10 After each incubation, protein was removed by sequential 

extractions with phenol (2x), n-butanol (2x), and the nucleic acid 
was recovered by precipitation with ethanol. 
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Construction of suitable vectors containing the desired 
coding and control sequences employs conventional ligation and 
restriction techniques. Briefly, isolated plasmids, DNA 
sequences, or synthesized oligonucleotides are cleaved, tailored, 
5 and religated in the form-desired. 

The restriction endonucleases utilized for cleavage of the 
specific libraries used in accordance with the procedure outlined 
in Example II were Bam HI. £ac_l, Kpjil (Asp_718), Psil, Hindlll, and 
j£mal, however, other endonucleases or partial digests with 

1 0 SauMIA, for example, could have been used. Due to adenosine 

methylation, the commonly utilized restriction endonucleases 
EcgRI, Sail or Xho l were not used since DNA from T. aquaticus 
strain HB8 could not be cleaved by these enzymes. 

Restriction fragments resulting from the procedure outlined 
15 in Example II containing 5' overhangs may be blunt ended by filling 
in with DNA polymerase I large (Klenow fragment) in the presence 
of the four deoxynucleotide triphosphates using incubation times 
of about 15 to 30 minutes at 37°C in 50 mM Tris pH 7.6 buffer 
containing 50 mM NaCI, 10 mM MgCl2, 10 mM DTT, and 50-100 

2 0 u.M deoxynucleotide triphosphates. The Klenow fragment will fill 

in at 5' sticky ends. If 3' overhangs are generated, they may be 
chewed back with mung bean nuclease. After treatment with 
Klenow, the mixture is extracted with phenol/chloroform and 
precipitated with ethanol. Subsequent treatment under 
2 5 appropriate conditions with S1 nuclease results in hydrolysis of 
any single stranded portion. These conventional procedures may 
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be used for cloning any fragment into a (blunt end) site within the 
vector. 
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EXAMPLE II! 
(vector construction) 
In vector constructions, the linearized vector is commonly 
treated with a phosphatase enzyme (or alternatively with a 

5 second nearby restriction endonuclease) to prevent 

recircularization of the vector in the absence of insert DMA. For 
example, a sample of Bam HI (5' overhang) or Sad (3' overhang) 
DNA (9 ug) in 150 u.l 50 mM Tris HCI buffer at pH 8.0 and 
containing 10 mM MgCl2 and 6 mM mercaptoethanol in the 

0 presence of Na+ may be treated with Calf Intestine Alkaline 
Phosphatase (CIAP, 22 units) at 37°C for 15 min., followed by 
incubation at 50°C for 30 min. to remove phosphate groups from 
either 5" or 3' overhangs. Alternatively, Bacterial Alkaline 
Phosphatase (BAP, 10 units) may be used in 150 u.l 10 ml Tris HCI 

5 in the presence of Na+ and Mg++ and incubating at 60°C for about 
1 hr. CIAP may be subsequently denatured by the addition of EDTA 
and EGTA to chelate divalent cations, and heating to 65°C for 15 
min. Either CIAP or BAP protein is them removed by sequential 
extractions with phenol (2x), n-butanol (2x), and nucleic acid 

0 recovered by precipitation with ethanol. 

The effectiveness of the phosphatase step is assayed by 
comparison of the number of transformants generated when 
vector is religated in the absence or presence of insert DNA. 
Typical results of from 10 to 100 fold more transformations 

5 when insert DNA is present is indicative that the vector DNA has 
been properly phosphatased. 
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EXAMPLE IV 
(ligations) 

Ligations were performed in 30-100 u.l volumes using 1-2 
jig linearized and phosphatased vector made as previously 
5 described. 2-4 jig T. aquaticus DNA cut with a restriction 
endonuclease generating the same ends as the vector, in 50 mM 
Tris HCI buffer at pH 8.0 and containing 10 mM MgCl2, 1 mM EDTA, 
1 mM ATP, 6 mM mercaptoethanol and from 3 to 7 (Weiss) units of 
T4 ligase, by incubating at either 4 or 15°C overnight. After 

1 0 ligation, EDTA was added, the T4 ligase inactivated by heating the 
solution to 65°C for 15 min., and nucleic acids recovered by 
ethanol precipitation. 

Ligation mixtures were introduced into a suitable host such 
as E. coli strains RR1 , AK53 or AK76 - the last one suitable for 

15 immediate positive selection of the lig+ phenotype - via 

conventional transformation procedures [see Hanahan, J. Mol. Biol. 
166:3243 (1987)]. Transformants were selected by plating on 
ampicillin (or other drugs such as tetracycline or kanamycin 
depending upon the plasmid used) containing plates. For positive 

20 selection of the lig+ phenotype, AK76 transformants were plated 
onto SOB plates (made by autoclaving 20 g Bacto™-tryptone, 5 g 
Bacto™-yeast extract, 0.5 g NaCI, 16 g Bacto™-agar in 1 liter of 
distilled water adjusted to pH 7.5 with NaOH prior to autoclaving, 
then adding 20 ml 1 M MgS04) containing 0.2% maltose, 0.2 mg/ml 

25 IPTG (to induce the lac promoter), and 50 u.g/ml ampicillin (to 
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select the plasmid-containing cells), and grown overnight at 42°C 
to 42.5°C. 
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Libraries ranged in size from about 5,000 to 27,000 clones. 
Given the general estimate that the bacterial chromosome 
contains about 2,000 to 4,000 kilobases, and the average insert 
consisted of 5 to 10 kb, it was apparent that several libraries 
5 contained redundant clones. 

Mixed plasmid preparations were made from six libraries 
using conventional techniques [see Methods Enzymol. 100:243 
(1983)], and introduced into fresh AK76 cells. Transformants 
from each library were plated on 6 SOB plates (each plate 
10 receiving between 30,000 and 70,000 clones) and incubated at 
42°C. One library produced from 11 to 19 exceedingly small 
colonies per plate; the remaining libraries produced an occasional 
large colony. 

Individual clones were picked, plasmid DNA prepared using 
15 conventional techniques [see Anal. Biochem. 114:193 (1981)], and 
analyzed by restriction digestion. All 12 small clones produced a 
6.8 kb plasmid containing two Bam HI fragments (1.8 and 2.1 kb 
respectively) cloned within the Bam HI site of pTZ18R. One such 
plasmid has been designated pDZ1 as depicted in Figure 1. By 
20 calculating back to the original library, (of 5,200 clones), it 
appears that all pDZ1 plasmids derived from a single clone. The 
large colonies contained plasmids close to the size of the original 
vector. Therefore, these large colonies are probably revertants of 
the chromosomal iigts7 gene which contained any plasmid solely 
2 5 to confer resistance to ampicillin. 
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Retransforming plasmid pDZ1 into AK76 cells, and selecting 
at 42°C on SOB plates containing maltose, IPTG, and ampicillin as 
described in Example IV, again yielded small colonies. Plating 
fresh transformants on tryptone yeast agar containing ampicillin 
5 did not produce colonies. This result suggests that induction of 
the lac promoter during plasmid establishment is necessary for 
production of sufficient quantities of T. aquaticus ligase to 
complement the genetic defect. Once the plasmid has become 
established in AK76 cells, such clones will give exceedingly 

10 small colonies when streaked and allowed to grow on tryptone 
yeast plates containing ampicillin at 42°C. 

Digestion of pDZ1 with Bam HI. followed by religation would 
scramble the fragments. Transformation of such a ligation mix 
into AK76, followed by plating at 37°C, i.e. under non-selective 

15 conditions, compared to plating at 42°C, i.e. under selective 

conditions, yielded 1,000 fold more colonies under non-selective 
conditions. The starting pDZ1 plasmid yielded only 2 fold more 
colonies under non-selective than selective conditions. This 
finding strongly suggests that the presence of both fragments, 

20 and the orientation they are cloned, is necessary for proper 
expression of T. aquaticus ligase. 

Although pDZ1 contains several Sac l and Sma l sites, it only 
contains a single (vector derived) £stl, Kpn l. or Hindlll site. Thus, * 
it would have been expected that a number of ligase clones would 

25 have been isolated from the Pstl. Kpn l. or Hindlll digest libraries. 
However, the only ligase clone was derived from the partial 
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BamH I digest library. Although it is not clear why this happened, 
one conceivable explanation is that other clones did not bring the 
lac promoter controlling element sufficiently close to the start 
of the ligase gene to adequately express the ligase protein during 
5 plasmid establishment. 

The cloning of T. aquaticus ligase as described above will 
now enable those skilled in the art to clone any thermophilic or 
thermostable ligase, whether of procaryotic, archebacterial, 
eukaryotic or phage origin by additional approaches. Accordingly 
10 the cloning of such ligases are within the scope of the present 
invention. 

Such additional approaches to cloning may include, for 
example, (i) cloning T. aquaticus DNA into a red" lambda vector 
and screening for the ability of recombinant phage lambda to form 

1 5 plaques at 39°C on a Hflts7 strain such a AK76 [essentially as 
generally described in J. Mol. Biol. 132:471 (1979)]; (ii) use of the 
lambda gt11 phage to express portions of the ligase gene, and 
subsequently screening with antibodies raised to purified T. 
aquaticus ligase - the positive lambda gt11 clone may then be 

20 used to identify the full length gene by hybridization to other 

plasmid or phage libraries, essentially as described in the cloning 
of T. aquaticus polymerase [see J. Biol. Chem 264:6427 (1989)]; 
90(iii) based upon the ligase DNA sequence, probes can be made 
that would hybridize to and therefore help to identify and retrieve 

2 5 other thermostable ligase encoding sequences in a variety of 
species. Accordingly, portions of the DNA encoding at least five 
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amino acids from T. aquaticus ligase can be replicated, or 
amplified using PCR techniques, and the denatured or single 
stranded forms may be used as probes to retrieve additional DNAs 
encoding a thermophilic or thermostable ligase. Alternatively, 

5 oligodeoxyribonucleotide probes can be synthesized which encode 
at least five amino acids, and these may be used to retrieve 
additional DNAs encoding a thermophilic or thermostable ligase. 

The selection of a portion of DNA encoding for at least five 
amino acids is based upon the portion containing fifteen nucleic 

0 acid bases which is more than the statistical minimum length 
that an oligonucleotide should have in order to find a single 
complementary sequence in a genome. However, portions slightly 
smaller (the minimum number in E. colt is, for example 12, 
indicating a portion as small as that encoding for four amino 

5 acids may be acceptable) or larger (the minimum number for 

higher animals is as high as 19, indicating that a portion encoding 
for at least seven amino acids may be necessary) [see 
Oligonucleotides: Antisense Inhibitors of Gene Expression, vol. 12, 
pages 137 - 140, Macmillan Press Ltd., London (1989)] may be 

0 used to obtain similar results However, because there may not be 
a precise match between the nucleotide sequence in the 
corresponding portions between species, oligomers containing 
approximately 15 nucleotides are a preferred minimum in order to 
achieve hybridization under conditions of sufficient stringency to 

5 eliminate false positives; the sequence encoding 5 amino acids 
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would supply information sufficient for the generation of such 
probes. 

By way of example, a comparison of the T. aquaticus ligase 
and E. coli amino acid sequences reveals an identity between 
5 amino acids 34-40 (Asp-Ala-Glu-Tyr-Asp-Arg-Leu) at 

statistically acceptable levels. Using the preferred six amino 
acid sequence, a degenerate probe of the form GA(C/T)- 
GC(G/A/T/C)-GA(G/A)-TA(C/T)-GA(C/T)-(C/A)G(G/A/T/C)-(C/T)T 
could be used to identify and retrieve either of the above ligases. 
1 0 The areas of sequence identities between the Thermophilus 
ligase according to the present invention and E. coli ligase 
include the amino acids at the following positions: 



Amino Acid Positions Consecutiv e identical aa's 



25 



20 



15 



34 to 40 7 

57 to 61 5 

137 to 142 6 

168 to 175 8 

199 to 210 12 

212 to 219 8 

308 to 312 5 

333 to 339 7 

485 to 490 6 

492 to 496 5 

513 to 517 5 

620 to 624 5 



Overall, of the 676 amino acids contained in the ligase 



according to the present invention, the percent similarity 
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between the Thermophilus ligase and E. coll ligase is 66%; the 
percent identity is 47%. 

The construction of an overproducer strain from a cloned 
and properly oriented gene may be achieved by using procedures 
5 which are conventional in the art. The general principle of such 
construction is to bring an enabling sequence into close proximity 
to the starting codon of the gene to affect efficient transcription 
and translation of that gene. There are many promoter systems 
(including a ribosome binding site [see Proc. Natl. Acad. Sci. USA 

10 78:5543 (1981)]) that have been successfully used to turn on 
genes, including the lac promoter, the trp promoter [see Gene 
20:231 (1982)], the lambda phage Pl promoter [see Nature 
292:128 (1981)], the tac fusion promoter [see Proc. Natl. Acad. 
Sci. USA 80:21 (1983)], and the T7 phage promoters [see Proc. 

15 Natl. Acad. Sci. USA 82:1074 (1985)]. 

Plasmid pDZ1 contains the T. aquaticus ligase gene 
downstream from both lac and T7 promoters present in the 
starting vector. There are several methods for removing excess 
DNA sequences from between the promoters and the gene, 

2 0 including use of Bal31 [see Nucl. Acids Res. 5:1445 (1978)] and 
ExoJII and Mung Bean or Si Nuclease [see Meth. Enzymol. 155:156 
(1987)]. However, a somewhat simpler method as described in 
Example V was used to bring the amino terminus of the 7. 
aquaticus ligase gene closer to the two promoters in the present 

2 5 instance. 
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EXAMPLE V 

(removal of excess DNA from between promoter and gene) 
Plasmid pDZ1 was randomly linearized with the restriction 
endonuclease HinPI (G CGC) and blunt ended with Klenow or 
5 alternatively with £yiJI (PuG CPy) [see DNA and Protein 
Engineering Techniques 1:29 (1988)]. 

DNA was purified by sequential extractions with phenol 
(2x), n-butanol (2x), and the nucleic acid recovered by 
precipitation with ethanol. These randomly linearized plasmids 

10 were then treated with Asd 718 which cleaves the polylinker site 
directly downstream of the two promoters, and blunt ended with 
Klenow. The resulting fragments were separated via 
electrophoresis in low melting agarose, sequential slices 
(including full length linear and progressively smaller DNA 

1 5 fragments) excised, and the DNA recovered. The DNA fragments 
were subsequently recircularized by blunt end ligation. This 
involved overnight incubation at 4°C in 100 u.1 in 50 mM Tris HCI 
pH 8.0 buffer containing 10 mM MgCl2, 1 mM EDTA, 1 mM ATP, 6 
mM mercaptoethanol, and from 3 to 7 Weiss units of T4 ligase. 

20 After ligations, EDTA was added, the T4 ligase inactivated by 
heat (for 15 min at 65°C), and nucleic acids recovered by ethanol 
precipitation. 

The ligation mixes prepared were introduced into AK76 
cells using conventional techniques, and the Iig+ phenotype was 
25 selected at 42°C on SOB plates containing maltose, IPTG, and 
ampicillin as described previously. 
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Based upon previous work, plasmids containing deletions 
between the promoters and the start of the T. aquaticus ligase 
gene would be expected to confer viability under these conditions. 
Deletions of the vector (promoter regions), or of an essential 
5 portion of the ligase gene should not confer viability. Therefore, 
individual clones were picked, plasmid DNA prepared using 
conventional methods [see Anal. Biochem. 114:193 (1981)], and 
analyzed by restriction digestion. Results from this testing found 
that plasmid pDZ2, pD23, pDZ6 and pDZ7 lacked the 1.8 kb Bam HI 

10 fragment, and contained instead a 1.3, 1.4, 1.2, or 1.2 kb fragment, 
respectively. All these plasmids re-created the Asp718 site as 
would be expected with proper blunt end fill-ins and ligations. 
Single stranded DNA was prepared from these plasmids using 
conventional techniques [see Nucl. Acids Research 13:1103 

15 (1985), and Protein Engineering 1:64 (1986)], and these were 
sequenced using the universal "reverse primer" oligonucleotide 
5'd(AGCGGATAACAATTTCACACAGGA)3' and T7 DNA polymerase 
[see Proc. Natl, Acad. Sci. USA 84:4767 (1987)]. 

Analysis of the DNA sequence reveals two ATG start codons, 

20 the first open reading frame being three codons in length and the 
second, the ligase DNA sequence, giving a long reading frame. In 
conjunction with Figure 1, this sequence (including the partial 
ligase DNA sequence) derived from plasmids pDZ6 and pDZ7 is: 
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pTZ18R 

GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT 

XacZ' T7 Promoter 
CACACAGGAA ACAGCTATGA CCATGATTAC GAATTTAATA 

pDZ6,7 

CGACTCACTA TAGGGMEEC ^(TrTCGGTA QC CCAAGGTA 
EcoRI Sac I Kgnl 

CACIAGGQOC 



thermophilic ligase 





ACT 


CTG 


GAA 


GAG 


GOG 


AGG 


AAG 


CGG 


GTA 


AAC 


GAG 


TTA 


CGG 


GAC 




ATT 


CGC 


TAC 


CAC 


AAC 


TAC 


CGC 


TAC 


TAC 


GTC 


CTG 


GCG 


GAC 


COG 


GAG 


ATC 


TCC 


GAC 


GCC 


GAG 


TAC 


GAC 


CGG 


CTT 


CTT 


AGG 


GAG 


CTC 


AAG 


GAG 


CTT 


GAG 


GAG 


CGC 


TTC 


CCC 


GAG 


CTC 


AAA 


AGO 


COG 


GAC 


TCC 


CCC 


ACC 


CTT 


CAG 


GTG 


GGG 


GOG 


AGG 


OCT 


TTG 


GAG 


GCC 


ACC 


TTC 


CGC 


CCC 


GTC 


CGC 


CAC 


CCC 


ACC 


CGC 


ATG 


TAC 


TCC 


TTG 


GAC 


AAC 


GCC 


TTT 


AAC 


CTT 


GAC 


GAG 


CTC 


AAG 


GCC 


TTT 


GAG 


GAG 


CGG 


ATA 


GAA 


CGG 


GCC 


CTG 


QGG 


CGG 


AAG 


GGC 


CCC 


TTC 


GCC 


TAC 


ACC 


GTG 


GAG 


CAC 


AAG 


GTG 


GAC 


GOG 


CTT 


TCC 


GTG 


AAC 


CTC 


TAC 


TAC 


GAG 


GAG 


GGG 


GTC 


CTG 


GTC 


TAC 


GGG 


GCC 


ACC 


GCC 


GGG 


GAC 


GGG 


GAG 


GTG 


GGG 


GAG 


GAG 


GTC 


ACC 


CAG 


AAC 


CTC 


CTC 


ACC 


ATC 


CCC 


ACC 


ATC 


COG 


AGG 


AGG 


CTC 


AAG 


GGG 


GTG 


COG 


GAG 


CGC 


CTC 


GAG 


GTC 


CGG 


GGG 


GAG 


GTC 


TAC 


ATG 


CCC 


ATA 


GAG 


GCC 


TTC 


CTC 


CGG 


CTC 


AAC 


GAG 


GAG 


CTG 


GAG 


GAG 


CGG 


GGG 


GAG 


AGG 


ATC 


TTC 


AAA 


AAC 


OCT 


AGG 


AAT 


GOG 


GCG 


GCG 


GGT 


TOO 


TTA 


AGG 


CAA 


AAA 


GAC 


CCC 


CGC 


ATC 


ACC 


GCC 


AAG 


CGG 


GGC 


CTC 


AGG 


GCC 


ACC 


TTC 


TAC 


GCC 


TTA 


GGG 


CTT 


GGG 


CTG 


GAG 


GAG 


GTG 


GAG 


AGG 


GAA 


GGG 


GTG 


GOG 


ACC 


CAG 


TTT 


GCC 


CTC 


CTC 


CAC 


TGG 


CTC 


AAG 


GAA 


AAA 


GGC 


TTC 


OOC 


GTG 


GAG 


CAC 


GGC 


TAC 


GCC 


CGG 


GCC 


GTG 


GGG 


GCG 


GAA 


GGG 


GTG 


GAG 


GOG 


GTC 


TAC 


CAG 


GAC 


TGG 


CTC 


AAG 


AAG 


CGG 


CGG 


GOG 


CTT 


CCC 


ttt 


GAG 


GOG 


GAC 


GGG 


GTG 


GTG 


GTG 


AAG 


CTG 


GAC 


GAG 


CTT 


GCC 


CTT 


TOG 


CGG 


GAG 


CTC 


GGC 


TAC 


ACC 


GCC 


CGC 


GCC 


CCC 


CGG 


TTC 


GCC 


ATC 


GCC 


TAC 


AAG 


TTC 


CCC 


GCC 


GAG 


GAG 


AAG 


GAG 


ACC 


CGG 


CTT 


TTG 


GAC 


GTG 


GTC 


TTC 


CAG 


GTG 


GGG 


CGC 


ACC 


GGG 


CGG 


GTG 


ACC 


OOC 


GTG 


GGG 


ATC 


CTC 


GAG 


CCC 


GTC 


TTC 


CTA 


GAG 


GGC 


AGO 


GAG 


GTC 


TCC 


CGG 


GTC 


ACC 


CTG 


CAC 


AAC 


GAG 


AGO 


TAC 


ATA 


GAG 


GAG 


TTG 


GAC 


ATC 


CGC 


ATC 


GGG 


GAC 


TGG 


GTT 


TTG 


GTG 


CAC 


AAG 


GCG 


GGC 


GGG 


GTC 


ATC 


CCC 


GAG 
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GTC 


CTC CGG 


GTC CTC AAG GAG 


AGG 


CGC ACG 


GGG 


GAG 


GAA 


AGG 


CCC 


ATT 


CGC TGG 


QCC GAG AOC TGC 


COC 


GAG TGC 


GGC 


CAC 


CGC 


CTC 


CTC 


AAG 


GAG GGG 


AAG GTC CAC CGC 


TGC 


COC AAC 


COC 


TTG 


TGC 


COC 


GOC 


AAG 


GQC TTT 


GAG GOC ATC CGC 


CAC 


TTC GOC 


TOO 


CGC 


AAG 


GOC 


ATG 


GAC 


ATC CAG 


GGC CTG GGG GAA 


AAG 


CTC ATT 


GAG 


AGG 


CTT 


TTG 


GAA 


AAG 


GGG CTG 


GTC AAG GAC GTG 


GOC 


GAC CTC 


TAC 


CGC 


TTG 


AGA 


AAG 


GAA 


GAC CTG 


GTG GGC CTG GAG 


CGC 


ATG GGG 


GAG 


AAG 


AGO 


GOC 


CAA 


AAC 


CTC CTC 


CGC GAG ATA GAG 


GAG 


AGO AAG 


AAA 


AGA 


GGC 


CTG 


GAG 


CGC 


CTC CTC 


TAC GOC TTG GGG 


CTT 


COC GGG 


GTG 


GGG 


GAG 


GTC 


TTG 


QCC 


CGG AAC 


CTG GOG GOC CGC 


TTC 


GGG AAC 


ATG 


GAC 


CGC 


CTC 


CTC 


GAG 


GOC AGO 


CTG GAG GAG CTC 


CTG 


GAG GTG 


GAG 


GAG 


GTG 


GGG 


GAG 


crc 


ACG GOG 


AGG GOC ATC CTG 


GAG 


AOC TTG 


AAG 


GAC 


COC 


GOC 


TTC 


CX3C 


GAC CTG 


GTA COG AGG CTC 


AAG 


GAG GOG 


GGG 


GTG 


GAG 


ATG 


GAG 


GOC 


AAG GAG 


AAG GGC GGG GAG 


QCC 


CTT AAA 


GGG 


CTC 


AOC 


TOC 


GTG 


ATC 


AOC GGG 


GAG CTT TOO CGC 


COC 


CGG GAA 


GAG 


GTG 


AAG 


GOC 


CTC 


CTA 


AGG CGC 


CTC GGG GOC AAG 


GTG 


ACG GAC 


TOC 


GTG 


AGO 


CGG 


AAG 


ACG 


AGO TAC 


CTC GTG GTG GGG 


GAG 


AAC COG 


GGG 


GAG 


AAC 


COG 


GGG 


AGO 


AAG CTG 


GAG AAG GOC AGG 


GOC 


CTC GGG 


GTC 


COC 


ACC 


CTC 


ACG 


GAG 


GAG GAG 


CTC TAC CGG CTC 


CTG 


GAG GOG 


CGG 


ACG 


GGG 


AAG 


AAG 



20 GOG GAG GAG CTC GTC TAA AGGCTTCC 

The nucleic acid sequence for the thermophilic ligase 
according to the present invention corresponds to the amino acid 
sequence: 

Mat Thr Leu Glu Glu Ala Arg Lys Arg Val Asn Glu Leu Arg Asp 
25 Leu lie Arg Tyr His Asn Tyr Arg Tyr Tyr Val Leu Ala Asp Pro 
Glu lie Ser Asp Ala Glu Tyr Asp Arg Leu Leu Arg Glu Leu Lys 
Glu Leu Glu Glu Arg Phe Pro Glu Leu Lys Ser Pro Asp Ser Pro 
Thr Leu Gin Val Gly Ala Arg Pro Leu Glu Ala Thr Phe Arg Pro 
Val Arg His Pro Thr Arg Met Tyr Ser Leu Asp Asn Ala Phe Asn 
3 0 Leu Asp Glu Leu Lys Ala Phe Glu Glu Arg lie Glu Arg Ala Leu 
Gly Arg Lys Gly Pro Phe Ala Tyr Thr Val Glu His Lys Val Asp 
Gly Leu Ser Val Asn Leu Tyr Tyr Glu Glu Gly Val Leu Val Tyr 
Gly Ala Thr Arg Gly Glu Gly Glu Val Gly Glu Glu Val Thr Gin 
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Asn Leu Leu 
Pro Glu Arg 
Ala Phe Leu 
lie Phe Lys 
5 Lys Asp Pro 
Tyr Ala Leu 
Ala Thr Gin 
Pro Val Glu 
Glu Ala Val 

1 0 Phe Glu Ala 

Try Arg Glu 
Ala Tyr Lys 
Val Val Phe 
lie Leu Glu 
15 Thr Leu His 
Gly Asp Trp 
Val Leu Arg 
lie Arg Trp 
Lys Glu Gly 

2 0 Lys Arg Phe 

Asp lie Gin 
Lys GLy Leu 
Glu Asp Leu 
Asn Leu Leu 

2 5 Arg leu Leu 

Ala Arg Asn 
Glu Ala Ser 
Leu Thr Ala 
Arg Asp Leu 

3 0 Ala Lys Glu 

He Thr Gly 
Leu Arg Arg 
Thr Ser Tyr 
Lys Ala Arg 



Thr He Pro Thr 
Leu Glu Val Arg 
Arg Leu Asn Glu 
Asn Pro Arg Asn 
Arg lie Thr Ala 
Gly Leu Gly Leu 
Phe Ala Leu Leu 
His Gly Tyr Ala 
Tyr Gin Asp Trp 
Asp Gly Val Val 
Leu Gly Tyr Thr 
Phe Pro Ala Glu 
Gin Val Gly Arg 
Pro Val Phe Leu 
Asn Glu Ser Tyr 
Val Leu Val His 
Val Leu Lys Glu 
Pro Glu Thr Cys 
Lys Val His Arg 
Glu Ala lie Arg 
Gly Leu Gly Glu 
Val Lys Asp Val 
Val Gly Leu Glu 
Arg Gin He Glu 
Tyr Ala Leu Gly 
Leu Ala Ala Arg 
Leu Glu Glu Leu 
Arg Ala He Leu 
Val Arg Arg Leu 
Lys Gly Gly Glu 
Glu Leu Ser Arg 
Leu Gly Ala Lys 
Leu Val Val Gly 
Ala Leu Gly Val 



He 


Pro Arg 


Arg 


Gly 


Glu Val 


Tyr 


Glu 


Leu Glu 


Glu 


Ala 


Ala Ala 


Gly 


Lys 


Arg Gly 


Leu 


Glu 


Glu Val 


Glu 


His 


Trp Leu 


Lys 


Arg 


Ala Val 


Gly 


Leu 


Lys Lys 


Arg 


Val 


Lys Leu 


Asp 


Ala 


Arg Ala 


Pro 


Glu 


Lys Glu 


Thr 


Thr 


Gly Arg 


Val 


Glu 


Gly Ser 


Glu 


He 


Glu Glu 


Leu 


Lys 


Ala Gly 


Gly 


Arg 


Arg Thr 


Gly 


Pro 


Glu Cys 


Gly 


Cys 


Pro Asn 


Pro 


His 


Phe Ala 


Ser 


Lys 


Leu lie 


Glu 


Ala 


Asp Leu 


Tyr 


Arg 


M=st Gly 


Glu 


Glu 


Ser Lys 


Lys 


Leu 


Pro Gly 


Val 


Phe 


Gly Asn 


Met 


Leu 


Glu Val 


Glu 


Glu 


Thr Leu 


Lys 


Lys 


Glu Ala 


Gly 


Ala 


Leu Lys 


Gly 


Pro 


Arg Glu 


Glu 


Val 


Thr Asp 


Ser 


Glu 


Asn Pro 


Gly 


Pro 


Thr Leu 


Thr 



Leu 


Lys 


Gly 


val 


Met 


Pro 


lie 


Glu 


Arg 


Gly 


Glu 


Arg 


Ser 


Leu 


Arg 


Gin 


Arg 


Ala 


Thr 


Phe 


Arg 


/->» ~\ _ _ 
Glu 


Gly 


val 


Glu 


Lys 


r^i v* 

Gly 


Pne 


Ala 


Glu 


Gly 


Val 


Arg 


Ala 


Leu 


Pro 


Glu 


Leu 


Ala 


Leu 


Arg 


Phe 


Ala 


ILe 


Arg 


Leu 


Leu 


Asp 


Thr 


Pro 


vai 


biy 


Val 


Ser 


Arg 


vai 


Asp 


lie 


Arg 


lie 


Val 


lie 


Pro 


PT ii 

blU 


Glu 


Glu 


Arg 


Fro 


TT-5 r-i 

rilS 


Arg 


Leu 


T i^vi i 
I8U 


Leu 


Cys 


Pro 


7Vl -a 

Ala 


Arg 


Lys 


Ala 


JXBU 


Arg 


Leu 


Leu 


Glu 


Arg 


Leu 


Arg 


Lys 


Lys 


Ser 


Aia 


PI r\ 

bin 








VJT-LU 


pi tt 


pi n 

blU 


Vd_L 


icU 


Asp 


Arg 


Leu 


Leu 


Glu 


Val 


Gly 


Glu 


Asp 


Pro 


Ala 


Phe 


Val 


Glu 


Mst 


Glu 


Leu 


Thr 


Phe 


Val 


Val 


Lys 


Ala 


Leu 


Val 


Ser 


Arg 


Lys 


Ser 


Lys 


Leu 


Glu 


Glu 


Glu 


Glu 


Leu 
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1^ Arg Leu Leu Glu Ala Arg Thr Gly Lys Lys Ala Glu Glu lieu 
Val 

Translation of the first 60 amino acids of this open reading 
frame (the thermophilic ligase) shows better than 50% homology 
5 to E. coli ligase [see Mol. Gen. Genet. 204:1 (1986)] suggesting 
that this long open reading frame represents the start of the T. 
aquaticus gene. From the genetic results with the Bam HI 
fragments, one can conclude that the size of this ligase is 
between 400 and 1,100 amino acids in length. The purified 

10 protein has been reported to have a molecular weight of about 
79,000 [see J. Biol. Chem. 259:10041 (1984)] which is within the 
limits of the genetic results found for the present invention. 
Given that clone pDZ7 produces functional T. aquaticus ligase 
(that is it encodes the gene in its entirety), and given the DNA 

1 5 sequence of the amino terminus, the entire DNA sequence of the 
gene was determined using either manual or automated methods 
as described in the literature [see, for example, Proc. Natl. Acad. 
Sci. 84:4767 (1987); Proc. Natl. Acad. Sci. 86:4076 (1989); 
Science 239:487 (1987); Nature 321:674 (1986); Biotechniques 

20 8:184 (1990); Proc. Natl. Acad. Sci. USA 85:5610 (1988); and Proc. 
Natl. Acad. Sci. USA 85:9436 (1988)]. 

Plasmids pDZ2, pDZ3, pDZ6 or pDZ7 may be used to construct 
further overproduction vectors using methods common to those 
skilled in biotechnology studies. This may include using 

2 5 promoters and ribosome binding sites as described above. For 
example, plasmid pDZ7 (see figure 1) may be linearized at its 
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unique Asp7 18 site, and excess nucleotides in front of the T. 
aquaticus ligase gene trimmed close to the ATG start codon by 
the use of Bal31 or a combination of Exol ll and Mung Bean or Si 
Nuclease as described above. This may then be blunt end ligated 
5 to a natural enabling sequence (a promoter and translation start 
sequence) generated in a similar manner, or by a synthetic 
enabling sequence manufactured for this purpose. In addition, 
sequences external or internal to the T. aquaticus gene may be 
modified to remove potential RNA structures that may inhibit 

1 0 transcription or translation. These methods have been reported 
previously to affect overproduction of the thermophilic 
restriction endonuclease Taq I to greater than 30% of soluble E. 
coli proteins [see Gene 65:166 (1988)]. Alternatively, synthetic 
oligonucleotides may be synthesized such that the start of the 7". 

1 5 aquaticus ligase gene is fused directly to an enabling sequence 
using PCR methods [see, for example, Biotechniques 8:178 (1990); 
Gene 77:51 (1989); and Nucl. Acids Res. 17:723 (1989)]. 

From the preceeding sequences, it can be seen that there is 
a Bgl II site corresponding to the nucleotides that code for amino 

2 0 acid residues 31-33. With this information, a strong promoter 
with an optimal Shine-Dalgarno sequence could be inserted in 
front of this gene using PCR. Two minor caveats need to be 
considered: (1) attempts to PCR copy the entire gene (3 kb, high 
GC content) were not always successful, and (2) plasmid pDZ7 had 

25 two Bam HI and Bgl II sites, one each within the ligase gene. 
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Plasmid pDZ7 was partially digested with both Bam HI and 
Bgl II, the correct size smaller linear fragment separated from 
full length linear by electrophoresis, excised, and purified as 
described previously. Since Bam HI and Bgl II produce the same 
5 overhang (5' GATC), the linear fragment could be recircularized 
with T4 ligase, and introduced into E. coli strain AK53 via 
transformation. Several clones had deleted the 0.5 kb 
Bam HVBgl II fragment resulting in a 5.7 kb plasmid, and one such 
clone was designated pDZ12. Synthetic oligonucleotides #66, 
0 #78, #85, and #94 were synthesized, to allow for fusion of pho A 
promoter [from plasmid pFBT64; see Gene 56:13 (1987)] and 
ribosome binding sequence to the start of the ligase gene using 
PCR [see Biotechniques 8:178 (1990); Gene 77:51 (1989); Gene 
77:61 (1989); and Nucl. Acids /?es.1 7:723 (1989)]. These clones 
5 are depicted in Fig. 9, and are: 

#66 19 mer; Pvu II site to T7 promoter through phoA 
promoter, top strand of plasmid pFBT64 (direction of Taql 
endonuclease gene): 

5' CTGGCTTATCGA AATTAAT 3' 
0 #78 32 mer; 5' end complementary to start of 

Thermus ligase gene; 3' end complementary to Shine- 
Dalgarno side of pho A promoter, bottom strand of plasmid 
pFBT64: 

5' CCA GGG TCA TTT TAT TTT CTC CAT GTA CAA AT 3' 
5 #85 33 mer; 5' end complementary to Shine-Dalgarno 

side of pho A promoter; 3' end complementary to start of T 
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hermus ligase gene, top strand of plasmid pDZ7 (direction of 
ligase gene): 

5' CAT GGA GAA AAT AAA ATG ACC CTG GAA GAG GCG 3' 
#94 18 mer; bottom strand of plasmid pDZ7 
5 corresponding to non-translated strand of amino acid 

residues 40 to 35 of ligase gene, downstream of Bgl II site 
at amino acid residues 33 to 31: 

5' AAG CCG GTC GTA CTC GGC 3' 
Briefly, this was accomplished in a single reaction tube in 

1 0 which 400 ng of primers #66 and #78 were added to 200 ng of 

Pst MPvu II digested pFBT64 containing 50 u.moles of dATP, cCTP, 
cGTP, and dTTP each, and 2.5 units Ampiitaq in 100 u.1 PCR buffer 
and cycled at 94°C for 1 min. 55°C for 2 min, 72°C for 3 min with 
3 sec. extension per cycle for 25 cycles as per the manufacturer's 

1 5 (Cetus, Emoryville, California) protocol. A second reaction tube 
contained 400 ng of primers #85 and #94, 200 ng of 
Eco RVBam HI digested pDZ7, in the same reaction buffer and 
enzyme, and incubated as above. The products of these reactions 
were shown to be the correct length as analyzed by gel 

20 electrophoresis. A third reaction tube contained 2uJ from each 
product, 400 ng primers #66 and #94 in the same reaction buffer 
and enzyme, and incubated as above. Primers were designed such 
that overlap between the two products would allow for PCR 
synthesis of the combined length fused product. The resultant 

2 5 fragment was extracted with phenol, n-butanol, and ethanol 

precipitated to remove Taq polymerase. The product PCR 
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fragment was treated with Bgl II and Eco Rl, electrophoresed in 
low melting agarose, and purified as described above. Meanwhile, 
the 2.7 kb Pst \-Bgl II ligase gene containing fragment from 
PDZ12 and the 2.4 kb Pst \-Eco Rl B-lactamase gene and origin 
5 containing fragment from pFBT64 were purified. All three 

fragments were combined in a three way ligation and introduced 
into E. coli strain AK53 via transformation. Several clones 
contained a 5.5 kb plasmid which overproduced ligase under pho A 
promoter control. One such plasmid has been designated pDZ13. 
10 In reported studies in overproduction of the thermophilic 

restriction endonuclease Taq I to greater than 30% of soluble E. 
coli proteins [see Gene 65:166 (1988)], it was noticed that 
endonuclease yields were somewhat better if the B-lactamase 
gene was reversed, and hence transcribing in the opposite 

1 5 direction as the pho A promoter. To make a similar construction 

with the ligase gene according to the present invention, the 2.3 kb 
Pst \-Pvu II fragment from plasmid pFBLT69 (which contains the 
B-lactamase in reverse orientation) was ligated to the 3.2 kb 
Pst l-Pvu II ligase gene containing fragment of plasmid pDZ13. 
20 The ligation mix was transformed into E. coli strain AK53, and 
several transformants were analyzed by restriction digests to 
confirm the orientation of B-lactamase gene. One such clone has 
been designated pDZ15. Production of ligase in pDZ15 is as good 
as, if not slightly better than, pDZ13. The ligase enzyme appears 

2 5 to be somewhat sensitive to proteases, and the cells should be 

grown for no more than 9 hours after induction. Proteolytic 
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products of the ligase gene may still have thermostable ligase 
activity (this has been demonstrated for Taq polymerase). 

Thermophilic proteins may be substantially modified and 
still retain sufficient activity for use in the present invention. 
5 For example, it has been shown that deletion of approximately 
one-third of the coding sequence at the ammo-terminus of Taq 
polymerase still produces a gene product that is active in 
polymerase activity [see J. Biol. Chem. 264:6427 (1989)]. 
Alternatively, another thermophilic protein, the restriction 

1 0 endonuclease Taq I, was shown to retain essentially full activity 

when amino acids were added to the amino-terminus (+7), the 
carboxy-terminus (+38), or at certain positions internally (from 
+2 to +34) [see Gene 65:166 (1988)]. Thus, modification of the 
primary structure by deletion, n-terminus addition, c-terminus 
15 addition, internal addition or duplication, or alteration of the 

amino acids incorporated into the sequence during translation can 
be made without destroying the activity or thermostable nature 
of the protein, in addition, the availability of DNA encoding these 
sequences provides the opportunity to modify the codon sequence 

2 0 so as to generate mutein forms also having ligase activity. Such 

substitutions or other alterations result in novel proteins having 
amino acid sequence encoded by DNA falling within the scope of 
the present invention. 

It will also be appreciated that other ligating proteins may 
2 5 be isolated by the process according to the present invention as 
exemplified in these examples. Different cell lines may be 
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10 



expected to produce ligases having different physical properties 
to that isolated from the T. aquaticus HB8 strain used in the 
making of the present invention. Additionally, variations may 
exist due to genetic polymorphisms or cell-mediated 
modifications of the enzyme or its precursors. Furthermore, the 
amino acid sequence of a ligase so isolated may be modified by 
genetic techniques to produce ligases with altered biological 
activities and properties. The resultant DNA sequence may then 
be able to encode a protein having substantially the same amino 
acid sequence as T. aquaticus HB8 ligase, but exhibiting a higher 
or lower level of activity. Such ligating proteins should also be 
considered to be within the scope of the present invention. 
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EXAMPLE VI 
(purification of ligase enzyme) 
E. coli cells AK53 containing plasmids pDZ6 and pGP1-2 
(containing the T7 RNA polymerase gene behind the lambda PL 
5 promoter and under control of the temperature sensitive lambda 
repressor C|587) [see Proc. Natl. Acad: Sci. USA 82:1074 (1985) 
and United States Patent 4,795,699], were grown overnight at 
32°C on TY plates containing ampicillin at 50 ug/ml and 
kanamycin at 50 u,g/ml to ensure maintenance of both plasmids. 

1 0 Fresh colonies were resuspended in 1 liter of sterile 50 mM Tris 

HCI buffer at pH 7.6 and containing 6 g NaCI, 25 g Bacto™- 
tryptone, 7.5 g yeast extract, 1 g glucose, 1.6 g casein amino acid 
hydrolysate, 50 ug/ml kanamycin and 50 u.g/ml ampicillin, and 
grown at 32°C in a 2 liter flask shaking at 200 rpm. When the 
1 5 O.D.550 reached between 0.8 and 1.0, synthesis of the T7 

polymerase was induced by shifting the cells to 42°C for 30 to 40 
minutes. Further synthesis of E. coli proteins were inhibited by 
the addition of 5 ml of 20 mg/ml rifampicin dissolved in 
methanol to a final concentration of 100 ug/ml. Under these 

2 0 conditions, only genes behind the T7 promoter should be 

transcribed and hence translated. Cells were incubated for an 
additional 5 hours at 42°C. 

Alternatively, E. coli cells AK53 containing plasmids 
pDZ1 5 (ligase under pho A promoter control) were grown 
2 5 overnight at 37°C on TY plates containing ampicillin at 50 ug/ml. 
Fresh colonies were resuspended in 50 ml of fortified broth 
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containing 50 jig/ml ampicillin and grown at 37°C in a 500 ml 
flask shaking at 200 rpm in a G76 benchtop shaker. When the 
O.D.500 reached between 0.65 and 0.85, 20 ml was diluted into 1 
liter of MOPS media containing 0.2 mM K2HPP4 [see J. 
5 Bacteriology 119:736 (1974)] to induce the phoA promoter. Cells 
were grown at 37°C in a 2 liter flask shaking at 200 rpm in a G25 
floor shaker for an additional 9 hours. 

Following incubation, the cells were chilled in ice, 
harvested by centrifugation (5,000 rpm for 15 min), resuspended 
10 in 20 ml of water, transferred to 35 ml centrifuge tubes, 

recentrifuged (7,000 rpm for 6 min), and the pellet frozen until 
ready for protein isolation. After thawing, the pellet was 
resuspended in 20 ml of buffer A (20 mM Tris HCI buffer at pH 7.6 
containing 1 mM EDTA) containing 10 mM 2-mercaptoethanol and 
15 0.15 mM PMSF. After sonication (5 x 1 min at 50% power at 4°C), 
the solution was centrifuged at 39,000 x g for 60 min. 

The enzyme has an estimated molecular weight of from 
75,000 to 85,000 daltons when compared with a phosphorylase B 
standard assigned a molecular weight of 92,500 daltons. 
2 0 Alternatively, 2 liters of pDZ15 induced cells were 

harvested, sonicated, and debris cleared by centrifugation as 
described above. 

The supernatant (40 ml) was brought to 300 mM KCI and 
passed through, a 5 ml DEAE sephacel column to remove 
2 5 extraneous DNA using 70 ml buffer A containing 0.3 M KCI. The 
fiowthrough fractions containing the ligase were combined, and 
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treated at 65°C for 20 minutes to irreversably heat denature 
many E. coli enzymes including endo or exonucleases. Denatured 
proteins were then removed by centrifugation at 39,000 x g for 
15 minutes, and the ligase enzyme precipitated from the 
5 supernatant by adding an equal volume of saturated (NH4)2S04 at 
room temperature for 30 minutes. The ammonium sulfate 
precipitate was harvested by centrifugation at 8,000 rpm in a 
clinical centrifuge, and resuspended in 4 ml of distilled water. 
Samples were dialyzed against buffer A, followed by buffer A 

1 0 containing 50 mM KCI. The dialized protein solution was applied 
to a 40 ml phosphocellulose column equilibrated with buffer A 
containing 50 mM KCI. After washing with 80 ml of the same 
buffer, the column was eluted with a 120 ml linear gradient of 
KCI (0.05 to 0.5 M) in buffer A. The enzyme eluted as a sharper 

1 5 peak from 0.25 to 0.35 M KCI. The protein migrates as two bands 
of apparent molecular weight approximately 81,000 (adenylated 
form) and 78,000 (non-adenylated form) and is about 98-99% pure 
as monitored by SDS-10% polyacrylamide gel electrophoresis. 
One can convert between the two forms by incubating 150 u.g 

2 0 protein in ligase buffer containing either 25 u,g nicked Salmon 

sperm DNA without NAD (resulting in the non-adenylated form), or 
in ligase buffer with 10 mM NAD (resulting in the adenylated 
form) for 30 min at 65°C. An equal volume of 20 mM Tris HCI pH 
8.0 in 100% glycerol containing 1 mM EDTA, 2 mM dithiothreitol 

2 5 (DTT), and 200 u.g/m'l Bovine Serum Albumin (Fraction V) is added 
(final glycerol concentration is 50%), and enzyme stored at either 
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-70°C or -20°C. From 2 liters of cells, a final yield of 6 mg 
Hgase in 16 ml storage buffer, at 625 nick closing units per 
microliter. This corresponds to a total of 10,000,000 units of 
enzyme, and a specific activity of 1,666,667 units/mg. 
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Since it is known that thermophilic proteins tend to be 
somewhat more hydrophobic than their mesophilic counterparts, 
addition of non-ionic detergents or other stabilizing agents may 
help in long term storage. Storage buffers may therefore include 
5 additional components such as glycerol (50%), sucrose (25%), 
protease inhibitors (0.5-1.0 mM PMSF, 10" 7 M pepstatin A), salt 
(KCI, preferably at 100-500 mM), EDTA (0.1-1.0 mM) bovine serum 
albumin (100-500 jig/ml), gelatin, dithiothreitol (1-10 mM), and 
mercaptoethanol (1-10 mM). In addition, it is preferable that the 

10 storage buffer contain at least one non-ionic polymeric detergent. 
A partial listing of such detergents would include ethoxylated 
fatty alcohol ethers and lauryl ethers, ethoxylated alkyl phenols, 
polyethylene glycol monooleate compounds, and more particularly 
Triton X-100, NP-40, and Tween 20 at 0.1-0.5% vol/vol. 

15 To assay for ligase activity, it is important to use a method 

that is not skewed by the melting temperature (T m ) of the 
substrates. For example, a 4 base cohesive end ligation is most 
efficient at a low temperature such as 4°C, well below the 
temperature optimum for T4 ligase (which is 37°C), and certainly 

2 0 below the temperature optimum of a thermophilic ligase. One 
assay method that should be consistent is the nick-closing assay 
in which circular plasmid DNA is randomly nicked in several 
places by DNasel. The ability of ligase to close all these nicks 
and generate covalently closed circular DNA can be assayed by 

2 5 separating nicked circle from open circle DNA via electrophoresis 
in an agarose gel containing ethidium bromide. For example, the 
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covalently closed circular form of plasmid pUC4KIXX [see Gene 
37:111 (1985)] migrates faster than the linear form, and 
considerably faster than the nicked form on a 1% agarose gel 
containing 0.2 M glycine NaOH pH 8.5 0.1 mM EDTA, and 1 u.g/m! 
5 ethidium bromide and run at 150V for 1.5 hr in the same buffer. 
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EXAMPLE VII 
(thermophilic ligase assay) 
Nicked pUC4KIXX DNA was generated by adding 3 jj.1 of 
freshly diluted 1 ng/ml DNasel to 5 ug DNA in 50 u.l Of 50 mM 
5 Tris HCI ph 8.0 buffer containing 10 mM MgCl2, 1 mM EDTA, and 6 
mM mercaptoethanol. The mixture was incubated at room 
temperature for 5 min, the DNase heat killed at 65°C for 10 min, 
and the sample stored until used be freezing at -20°C. Under 
these conditions, about 90% of the DNA was in the nicked circular 
1 0 form, with about 5% in the linear and 5% in the covalently closed 
circular form. 

Thermophilic ligase prepared as above was assayed by 
adding serial dilutions of ligase to 0.5 ug nicked pUC4KIXX in 20 
uJ of 20 mM Tris HCI pH 7.6 buffer containing 50 mM KCI, 10 mM 
15 MgCl2, 1 mM EDTA, 10 mM NAD, 10 mM dithiothreitol, overlaying 
with a drop of mineral oil, and incubating at 65°C for 15 min. As 
a control, T4 ligase was assayed by adding serial dilutions of 
ligase to 0.5 u.g nicked pUC4KIXX in 20 u.l of 50 mM Tris HCI pH 
8.0 buffer containing 10 mM MgCl2, 1 mM EDTA, 1 mM ATP, 6 mM 
20 mercaptoethanol, and incubating at 37°C for 15 min. 

Reactions were terminated by the addition of 4 p.l stop 
buffer containing 0.2 M EDTA, 50% glycerol, 1% SDS and 0.1% 
bromphenol blue, and the products were analyzed by gel 
electrophoresis as described above. 
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One nick closing unit of ligase is defined as the amount of 
ligase that circularizes 0.5 u,g of nicked pUC4KIXX DNA under the 
buffer and time conditions set forth in the preceding example, 
such that addition of further ligase does not circularize 
5 additional DNA. 

As a mini-prep procedure, E. coli cells AK53 containing 
plasmids pDZ15 (ligase underpftoA promoter control) were grown 
overnight at 37°C on TY plates containing ampicillin at 50 u,g/ml. 
Fresh colonies were resuspended in 5 ml of fortified broth 
1 0 containing 50 u.g/ml ampicillin, and grown at 37°C. When the 
O.D.550 reached between 0.65 and 0.85, 0.12 ml was diluted into 
6 ml of MOPS media containing 0.2 mM K2HPO4 to induce the pho 
A promoter. Cells were incubated overnight at 37°C (some 
proteolysis that occurs after prolonged incubation, so caution is 

1 5 advised in overgrowing induced cells). Cells were harvested in 

1.5 ml microcentrifuge tubes, resuspended in 0.3 ml of 20 mM 
Tris HCL pH 7.6 containing 1 mM EDTA and 10 mM 2- 
mercaptoethanol, and sonicated 2 x 10 seconds. After clearing 
debris by centrifugation (12,000 rpm for 2 min.), the supernatant 
20 was treated at 65°C for 20 min to irreversably heat denature 

many E. coli enzymes including the endo and exonucleaseses [see 
Gene 56:13 (1987)]. The denatured debris was removed by 
centrifugation and the supernatant assayed as described above. 
One microliter of this supernatant contained approximately 625 

2 5 nick closing units of activity. 
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The T. aquaticus ligase preparation described in the 
preceding examples, as well as commercially available T4 ligase, 
were shown to contain approximately 125 nick closing units per 
microliter. Thus, from 1 liter of E. coli cells overproducing T. 
5 aquaticus ligase, the process according to the present invention 
has purified approximately (800 x 125) 100,000 nick closing 

units of enzyme. 

The thermophilic ligase prepared according to the preceding 
description has a number of valuable properties which makes it 

1 0 especially useful as an assay that both amplifies DNA and allows 

it to discriminate a single base substitution in a DNA sequence. 
The single most important property of this ligase allowing for 
these uses is that the ligase retains activity during repeated 
thermal denaturation/renaturation cycles thus allowing for the 
1 5 amplification of DNA without necessitating repeated addition of 
ligase. In addition, the ligase according to the present invention 
will ligate oligonucleotides of a length which is sufficient to 
assure their uniqueness in complex genomes at or near the T m 
temperatures of 65°C, and will also accurately discriminate 

2 0 between exactly complementary and single based mismatched 

oligonucleotide sequences. 

In the simpler of the two procedures developed as a result 
of cloning the thermophilic ligase DNA sequence, termed a ligase 
detection reaction (LDR), two oligonucleotide probes are allowed 
25 to hybridize to denatured DNA such that the 3* end of one is 
immediately adjacent to the 5' end of the other. The 
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oligonucleotides are chosen to be sufficiently long (20 to 25 
nucleotides) such that each will preferentially hybridize to its 
unique position in the human genome. A thermophilic ligase can * 
then form a covalent phosphodiester bond between the two 
5 oligonucleotides, provided that the nucleotides at the junction are 
perfectly complementary to the target. The specificity of this 
nick-closing reaction is particularly enhanced by virtue of 
performing the ligation at or near the T m of the two 
oligonucleotides for their target. Thus, a single base mismatch at 

10 the junction not only forms an imperfect double helix, but also 
destabilizes the hybrid at the higher temperature. Consequently, 
thermophilic ligase will efficiently link correctly base paired 
oligonucleotides and give near zero background ligation in the 
presence of the imperfectly matched sequences. Using LDR, the 

15 amount of product obtained in the ligation reaction can be 
increased in a linear fashion by repeated thermal cycling. 

In the thermophilic ligase chain reaction according to the 
present invention, both strands serve as targets for 
oligonucleotide hybridization. By using an additional two 

20 oligonucleotides complementary to the opposite strand, the 
ligation products of one cycle become the targets for the next 
cycle of ligation as generally depicted in figure 2. For each 
adjacent oligonucleotide pair, the diagnostic nucleotide is on the .* 
3' side of the junction. Thus, aberrant target independent ligation 

25 of complementary oligonucleotides is avoided by use of 

temperatures near the T m , and by taking advantage or the poor 
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ligation efficiency of single base 3' overhangs. Using ligase chain 
reaction, the amount of product can be increased in an exponential 
fashion by repeated thermal cycling. 

In order to test the potential of the thermophilic ligase 
5 chain reaction (LCR), the gene encoding human B globin was 
selected as an initial model system to test the technique of the 
present invention. Previous work has determined that the normal 
B A allele and sickle B s allele differ by a single A->T transversion 
of the second nucleotide in the sixth codon of the B globin gene, 
1 0 changing a glutamic acid residue into a valine in the hemoglobin B 
chain according to the following Table I: 
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In the following continuation of Table I, presents the 
oligonucleotide sequences listed in the preceding portion in their 
conventional 5' --> 3' orientation: 



5 Sequence Sequence size 

no. 5' ~>3' (mer) 

( e C) 



101 


GT C ATG GTG CAC CTG ACT CCT GA 


23 


66 


10 102 


GTTT C ATG GTG CAC CTG ACT CCT GT 


25 


66 


103 


GTTTTT C ATG GTG CAC CTG ACT CCT GG 


27 


64 


104 


CT GC AGT AAC GGC AGA CTT CTC CT 


24 


68 


105 


CTTT GC AGT MC GGC AGA CTT CTC CA 


26 


68 


15 106 C 


TTTTT GC AGT AAC GGC AGA CTT CTC CC 


28 


66 


107 


G GAG AAG TCT GCC GTT ACT GCC 


22 


70 


109 


C AGG AGT CAG GTG CAC CAT GGT 


22 


70 



20 

Oligonucleotides containing the 3' nucleotide unique to each 
allele were synthesized with different length 5' tails (see Table 
I). Upon ligation to the invariant 32 P radiolabeled adjacent 
oligonucleotide, the individual products could be separated on a 
25 polyacrylamide denaturing gel and detected by autoradiography. 
Based upon these initial findings with autoradiography, 
subsequent assays were preformed using an automated, non- 
radioactive detection scheme in which the allele specific 
oligonucleotides were 5'-biotinylated for capture, and the 
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invariant oligonucleotides 3'-tailed with digoxygenin. The label 
was then visualized in an ELISA format using anti-digoxigenin 
conjugated to alkaline phosphatase, and a colorimetric substrate 
for the enzyme. 

5 As depicted in Table I, the nucleotide sequence and 

corresponding translated sequence of the oligonucleotides used in 
detecting 3A and fiS giobin genes are depicted. Oligonucleotides 
101 and 104 detect the BA target, while 102 and 105 detect the 
BS target when ligated to labelled oligonucleotides 107 and 104, 

10 respectively. Oligonucleotides 103 and 106 were designed to 
assay the efficiency of ligation of G:T or G:A and C:A or C:T 
mismatches using BA or BS giobin gene targets respectively. 
Oligonucleotides were designed with slightly different length 
tails to facilitate discrimination of various products when 

1 5 separated on a denaturing polyacrylamide gel. The tails which 
were not complementary to the target sequence, may be 
considered as being "reporter groups" for the individual sequence. 
Consequently, ligation of oligonucleotides 101, 102, or 103 to 
107 gives lengths of 45, 47, or 49 nucleotides, respectively. For 

20 the complementary strand, ligation of oligonucleotides 104, 105, 
or 106 to 109 gives lengths of 46, 48, or 50 nucleotides, 
respectively. The oligonucleotides were also designed to have 
calculated T m values of 66 to 70°C, which is just at or slightly 
above the ligation temperature. 

25 In order to detect the ligation products, oligonucleotides 

107 and 109 were 5'-end labelled with 32p us j ng j 4 
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polynucleotide kinase and - 32 P according to the following 
example. 



WO 91/17239 



68 



PCI7US91/02968 



EXAMPLE VIII 
(radioactive labelling) 
Oligonucleotide 107 (0.1 u.g) was 5' end labelled in 20 uJ 30 
mM Tris HCI buffer at pH 8.0 containing 20 mM Tricine, 10 mM 
5 MgCl2, 0.5 mM EDTA, 5 mM dithiothreitol, and 400 jiCi of 

[ 32 P]ATP f by the addition of 15 units of T4 polynucleotide kinase. 
After incubation at 37°C for 45 min, unlabelled ATP was added to 
1 mM, and incubation was continued an additional 2 min at 37°C. 
The reaction was terminated by the addition of 0.5 ul 0.5 M EDTA, 
10 and kinase heat inactivated at 65°C for 10 min. Unincorporated 
32 P label was removed by chromatography with Sephadex G-25 
pre-equilibrated with TE buffer. Specific activity ranged from 7 
x 10 8 to 10 x 10 8 cpm/jig of oligonucleotide. 
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The specificity of the T. aquaticus thermophilic ligase 
according to the present invention for complementary vs. 
mismatched target was compared under both LDR and LCR 
conditions (see figure 3 and the following Table II). In the LDR 
5 series, two adjacent oligonucleotides were incubated with 

denatured target DNA and ligase, where the last nucleotide of the 
unlabelled oligonucleotide was either complemented or 
mismatched the target DNA. The oligonucleotides were designed 
with slightly different length tails to facilitate discrimination of 

10 various products by allowing them to be separated on a denaturing 
gel. Consequently, as disclosed earlier, ligation of 
oligonucleotide 101 (8 A allele), 102 (fiS allele), or 103 to 
labelled 107 gives lengths of 45, 47 or 49 nucleotides, 
respectively. For the complementary strand, ligation of 

15 oligonucleotides 104 (B A allele), 105 (flS allele), or 106 to 
labelled 109 gives lengths of 46, 48 or 50 nucleotides, 
respectively. The oligonucleotides were also designed to have a 
calculated T m values of 66°C to 70°C, that is just at or slightly 
above the ligation temperature. Thus, the specificity of ligating 

2 0 two oligonucleotides hybridized to target DNA with perfect 
complementarity (A:T) could be directly compared to each 
possible mismatch (A:A, T:T, G:A, G:T, C:A, or C:T). The 
methodology for determining specificity of ligation of these 
oligonucleotides in the presence of B A or B s globin gene target 

2 5 was determined as in the following example: 



WO 91/17239 



70 



PCT/US91/02968 



EXAMPLE IX 

(determination of specificity of thermophilic ligase) 
Labelled oligonucleotide (200,000 cpm; 0.28 ng; 40 fmoles) 
and unlabelled oligonucleotide (.27 ng; 40 fmoles) were incubated 

5 in the presence of target DNA (1 fmole = 6 x 10 8 molecules Taq I 
digested BA or fiS globin plasmid) in 10 u.l 20 mM Tris HCI buffer 
at pH 7.6 and containing 100 mM KCI, 10 mM MgCl2, 1 mM EDTA, 10 
mM NAD, 10 mM dithiothreitol, 4 ug Salmon sperm DNA, and 15 
nick-closing units of the thermophilic ligase, and overlaid with a 

0 drop of mineral oil. The reactions were incubated at 94°C for 1 
min followed by 65°C for 4 min, and this cycle was repeated 
between 5 and 30 times. The reactions were terminated by the 
addition of 8 u.l formamide containing EDTA (10 mM), xylene 
cyanol (0.2%), and bromphenol blue (0.2%). Samples (4 u.1) were 

5 denatured by boiling for 3 min prior to loading (40,000 cpm/lane) 
into the gel. 

Products were separated by electrophoresis in which 
samples were loaded in groups of eight, run into the gel, and then 
the next set loaded, thereby accounting for the slightly slower 

0 mobility of the bands on the right side of the autqradiogram of 
figure 3. Electrophoresis was in a 10% polyacrylamide gel 
containing 7 M urea in a buffer of 100 mM Tris borate pH 8.9 and 1 
mM EDTA, for 2 hrs at 60 W constant power. 

After removing the urea by soaking for 10 min in 10% acetic 

5 acid followed by a second soak of 5 min in water, the gels were 
dried onto Whatman 3 mm paper and autoradiographed overnight at 
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-70°C on Kodak XAR-5 film (with or without Du Pont Cronex 
lighting plus intensifying screen). Bands from 20 cycles were 
excised from the gels and assayed for radioactivity. The results 
are given in Table II. 
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TABLE II 



Quantitation of complementary and mismatched LDR and LCR 
bands from 20 cycle LDR and 30 cycle LCR experiments described 
in Example IX and depicted in Fig. 3 were excised from gels and 
assayed for radioactivity. Percentage product formed = cpm in 
product band/cpm in starting oligonucleotide band. Percentage 
mismatched/complementary = cpm in band of mismatched 
oligonucleotides/cpm in band of complementary oligonucleotide 
using the same target DNA, and gives an indication of the noise to 
signal ratio. LDR amplification was performed using 6 x 108 
target molecules or 1 femtomole; LCR amplification was 
performed using 6 x 106 target molecules or 10 attomoles. 



15 LDR 



Oligo base: 
target base 



Product 
formed (%\ 



mismatched/ 
compleme ntary (°/ n ) 



25 



20 



A : T 
T: A 
T: A 
A : T 
A: A 
T:T 
T:T 
A : A 
G:T 
C:T 
G : A 
C:A 



21.5 
13.2 
17.9 
12.4 
<0.1 
0.12 
0.16 
<0.1 
0.30 
<0.1 
<0.1 
<0.1 



<0.4 
0.7 
1.0 
<0.4 
1.4 
<0.4 
<0.4 
<0.4 



3 0 LCR 



35 



A:T, T:A 
T:A, A:T 
A:A, T:T 
T:T, A:A 
G:T, C A 
GA C:T 



41.4 
10.4 

0.45 
<0.05 

0.51 
<0.05 



1.1 
<0.2 

1.3 
<0.2 



WO 91/17239 



73 



PCT/US91/02968 



Thus, the thermophilic T. aquaticus ligase was shown 
to discriminate complementary from mismatched oligonucleotide 
sequences for all possible mismatched base pairs in LDR assays. 
Under both competition and individual ligation experiments (at 
5 varying salt concentrations), the worst case mismatch ligations 
were 1.5 to 1.0% (see Table II, G:T and T:T), while others were 
0.4% to <0.1% (see Table II, A:A, C:T, G:A and C:A) of the products 
formed with complementary base pairs (A:T). This is 
substantially better than reported (using radioactive detection) 

10 for the mesophilic T4 ligase of E. coli [see Gene 76:245 (1989)]. 

In the LCR amplification/detection series of experiments, 
two adjacent oligonucleotides were incubated with denatured 
target DNA and ligase, as well as with the complementary set of 
oligonucleotides. Under these conditions, the 3' nucleotide of the 

15 unlabeiled diagnostic oligonucleotide either complemented or 
mismatched the target DNA, but always complemented its 
unlabeiled counterpart, i.e. A:T for 101 and 104, T:A for 102 and 
105, and G:C for 103 and 106. Thus, an initial "incorrect" ligation 
of a mismatched oligonucleotide would subsequently be amplified 

20 with the same efficiency as a correct ligation. Samples 

contained pairs of unlabeiled oligonucleotides (B A allele specific 
101 and 104, B s allele specific 102 and 105, or 103 and 106) 
with the complementary and adjacent pairs of labelled 
oligonucleotides, 107 and 109. These labelled and unlabeiled 

25 oligonucleotides were incubated in the presence of ligase and 10 
attomoles of target DNA (100 fold less target DNA than for LDR) 
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for 20 or 30 cycles as in Example IX,. The resulting bands are 
depicted in the left portion of figure 3 and the lower half of Table 
II. 

As can be seen in figure 3 and Table II, the thermophilic 

5 ligase according to the present invention was capable of 

discriminating complementary from mismatched oligonucleotide 
sequences for all possible mismatched base pairs in LCR assays. 
Under both competition and individual ligation experiments the 
worse case mismatch ligations were from 1.3% to 0.6% (G:T, C:A 

0 and AA, T:T), while others were <0.2% (T:T, A:A and G:A, C:T) of 
the products formed with complementary base pairs (A:T, T:A). 
LCR, using thermophilic ligase according to the present invention, 
is thus the only method which can both amplify and detect single 
base mismatches with high signal to noise ratios [see Genomics 

5 4:560 (1989)]. Thus, by utilizing LCR one can detect the 
difference between a single base mismatch such as occurs 
between BA and fiS, and use the results of this assay as a 
diagnostic for the normal, the carrier, or the diseased patient. 
When the entire set of experiments described above were 

0 repeated using buffer containing 150 mM instead of 100 mM KCI, 
the results were essentially the same as in figure 3 and tabulated 
in Table H, with ligation of mismatch oligonucleotides for LDR 
ranging from 0.6% to <0.3% and for LCR ranging from 1.7% to <0.3% 
of the exactly complementary products.. Thus, the exquisite 

5 discrimination between matched and mismatched oligonucleotides 
appears not to be critically dependent upon salt conditions. 
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Alternatively, a different procedure based on phosphatase 
may also be used. The LCR or LDR reaction may be performed in a 
10 u.l volume under mineral oil. To this is added 50u.l of 10 mM 
Tris HCI pH 7.6 containing 0.5 units of Bacterial Alkaline 
5 Phosphatase (BAP), and 10 mM MgCfe, and the incubation 

continued at 65°C for 2 hrs (note that the ligase enzyme is not 
killed under these conditions). The 5' end label on an 
oligonucleotide that has become covalently linked is no longer 
susceptible to BAP. Ligated product is separated from 

10 monophosphate by the addition of 20 u.1 of 10 mg/ml sonicated 
salmon sperm DNA as a carrier and precipitated with 20 uJ of 
50% TCA. After centrifugation for 5 min at 12,000 rpm, the 
supernatant is removed, and the ration of pellet to pellet + 
supernatant gives the percentage of product formed. A similar 

1 5 assay has been used with Taq I endonuclease, and the 

experimental error for positive and negative controls is around 1- 
2%. 

Use of the thermophilic ligase according to the present 
invention obviates the need to carefully titrate both salt and 
20 enzyme concentration as required for mesophiiic ligases. The 

data from this series of experiments is tabulated in the following 
Table III. 
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TABLE III 

Quantitation of complementary and mismatched LDR and LCR 
bands, at 100 and 150 mM KCl concentrations, from 20 cycle LDR 
5 and 30 cycle LCR experiments described in Example IX and 

depicted in Fig. 3. LDR amplification was performed using 6 x 10 8 
target molecules or 1 femtomole; LCR amplification was 
performed using 6 x 10^ target molecules or 10 attomoles. The 
mismatched/complementary gives an indication of the noise to 
10 signal ratio. 



15 



20 



25 



35 



LDR 



3 0 LCR 



Olino hs^p" 

Wliyu LsGO \5 . 


Product 




mismatched/ 


target base 


formed {%) 




complementary 




[KCl] (mM) 


[KCl] (mM) 




100 


150 


100 


150 


A : T 


21.5 


23.2 






T : A 


13.2 


17.2 






T : A 


17.9 


12.8 






A : T 


12.4 


11.7 






A : A 


<0.1 


<0.2 


<0.4 


<0.3 


T:T 


0.12 


0.21 


0.7 


0.3 


T:T 


0.16 


0.30 


1.0 


0.6 


A : A 


<0.1 


<0.2 


<0.4 


<0.3 


G:T 


0.30 


0.25 


1.4 


0.4 


C:T 


<0.1 


<0.2 


<0.4 


<0.3 


G : A 


<0.1 


0.25 


<0.4 


0.4 


C:A 


<0.1 


0.20 


<0.4 


0.3 


A:T, T:A 


41,4 


14.2 






T:A, A:T 


10.4 


18.5 






A:A, T:T 


0.45 


0.09 


1.1 


0.6 


T:T, A:A 


<0.05 


<0.05 


<0.2 


0.3 


G:T, C:A 


0.51 


0.24 


1.3 


1.7 


G A C:T 


<0.05 


<0.1 


<0.2 


<0.7 
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LCR and LDR specificity was tested using both B A and B s 
specific oligonucleotides in direct competition for ligation to the 
invarient labelled oligonucleotides. Using target DNA (BA, B s , 
and an equimolar ratio of B A and B s ) ranging from 1 femtomole 
5 to 1 attomole, thermophilic ligase specifically formed the 

correct product(s) in each case; no background incorrect ligation 
product was observed when only one target allele was present). 
However, the efficiency of forming the B s specific products is 
somewhat less than forming the flA products, and after 20 cycles 

1 0 of amplification, the BS specific products were approximately 
one-third of the B A specific products as quantitated by assaying 
excised products for radioactivity. Hence a direct competition 
assay, wherein two oligonucleotides are differentially labelled 
(for example with fluorescent groups) to quantitate the relative 

15 initial concentrations of each target sequence allele will require 
careful titrations for each allele. 

The specificity of LCR DNA amplification with sub-attomole 
quantities of target DNA was also examined. The extent of LCR 
DNA amplification was determined in the presence of target DNA 

20 ranging from 100 attomoles (6 x 10 7 molecules) to less than one 
molecule per tube. Reactions were incubated for 20 or 30 cycles, 
and products separated and quantitated as depicted in figure 4 and 
the following table IV. 
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TABLE IV 



10 



Quantitation of LCR amplification. Bands from 30 cycle LCR 
experiments were excised from the gels and assayed for 
radioactivity. At higher target concentration, DNA amplification 
was. essentially complete after 20 cycles; slightly imprecise 
excision of 30 cycle bands from this portion of the gel probably 
accounts for product formed values in excess of 100%. 
Percentage product formed = cpm in product band/cpm in starting 
oligonucleotide band; Amplification = No. of product molecules 
formed/No. of target molecules 



15 



20 



25 



30 



Target 


Product formed 


Amplificat 


Molecules 


(%) 


6 x 107 


134 




2 x 107 


96 




6 x 10S 


107 




2 x 106 


78 




6 x 105 


85 




2 x 105 


48 


5.8 x 104 


6 x 104 


25 


1.0 x 105 


2 x 10* 


4.5 


5.4 x 104 


6 x 103 


2.3 


9.2 x 104 


2 x 103 


0.36 


4.3 x 10* 


6 x 102 


0.18 


7.2 x 104 


2 x 102 


0.14 


1.7 x 105 


60 


<0.05 




20 


<0.05 




6 


<0.05 




2 


<0.05 




0 


<0.05 





3 5 



In the absence of target, no background signal was detected 
when carrier salmon sperm DNA (4 jxg) was present as seen in 
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figure 4. At higher initial target concentrations, DNA 
amplification was essentially complete after 20 cycles, while at 
lower initial target concentrations substantially more product is 
formed with additional amplification cycles. Under these 
5 conditions, 200 molecules of initial target DNA could easily be 
detected after 30 cycles. 

The thermostable nature of the enzyme is readily apparent 
in figure 4. By comparing the amount of product formed after 20 
cycles to that formed after 30 cycles, it is apparent that at the 
10 lower target DNA concentrations additional product is formed 
after more cycles (see especially 2 x 10* to 2 x 10 2 target DNA 
molecules). In other words, the enzyme still has activity after 20 
cycles of 94°C for 1 minute followed by 65°C for 4 minutes. 

Thus, 7. aquaticus ligase retains the ability to catalyze 

1 5 formation of a phosphodiester bond between two adjacent 

oligonucleotides hybridized to a complementary strand of DNA at 
a temperature in the range of about 50°C to about 85°C after 
repeated exposure to temperatures that denature DNA, namely in 
the range of about 105°C for about 0.25 minutes to about 4 
20 minutes. 

Hence, the specific amplification of a nucleic acid test 
substance of known nucleotide sequence using LCR requires: (1) 
two adjacent oligonucleotides complementary to and in molar 
excess of the target sequence nucleic acid, and having no 

2 5 mismatch to the target sequence nucleic acid at the junction of 

the adjacent oligonucleotides; (2) a second set of adjacent 
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oligonucleotides complementary to the first set of adjacent 
oligonucleotides, complementary to and in molar excess of the 
target sequence nucleic acid, and having no mismatch to the 
target sequence nucleic acid at the junction of this second set of 
5 adjacent oligonucleotides; (3) a thermostable ligase which does 
not become irreversibly denatured and lose its catalytic ability 
when subjected to temperatures of from about 50°C to about 
105°C; and (4) subjecting this ligase mixture to repeated 
temperature cycles which comprises a first temperature to 

1 0 denature the DNA (in a range of about 90°C to about 105°C), and a 

second temperature to allow for hybridization/ligation (in a range 
of about 50°C to about 85°C). In the amplification of BA giobin 
allele described above, the components were (1) oligonucleotides 
101 and 107; (2) oligonucleotides 104 and 109; (3) T. aquaticus 
15 ligase; and (4) 30 temperature cycles of 94°C for 1 minute 
followed by 65°C for 4 minutes. 

In figure 4, bands of 45 and 46 nucleotides correspond to 
ligation products of the coding and complementary B A giobin 
oligonucleotides. Lower molecular weight products correspond to 

2 0 ligation of deletion oligonucleotides present in the initial 

ligation reaction. Since samples were loaded in groups of eight, 
the right side of the autoradiogram gives the appearance of 
slower migration. 

To further test the ability of ligase to discriminate 
2 5 between complementary and mismatched oligonucleotides, an LCR 
experiment was. performed in the presence and absence of 
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oligonucleotides which would give G-T and C-A mismatches in 
accordance with the following example which not only shows DNA 
amplification, but also supports the thermostable nature of the 
enzyme found in Example IX. 
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EXAMPLE X 

One set of experiments contained 40 fmoles each of 
unlabelled 101 and 104 oligonucleotides, while the second set had 
in addition 40 fmoles of unlabelled 103 and 106 oligonucleotides. 

5 Both sets contained 40 fmoles each of labelled 107 and 109. 
Labelled oligonucleotides (200,000 cpm; .28 ng; 40 fmoles) and 
unlabelled oligonucleotides (.27 ng; 40 fmoles) were incubated in 
the presence of target DNA, ranging from 100 attomoles (6 x 10 7 
molecules) to 0.01 attomoles (6 x 103 molecules) of Taq I 

0 digested BA or BS globin plasmid. Incubation was carried out in 
10 u.l 20 mM Tris-HCI, ph 7.6 buffer containing 100 mM MgCl2, 1 
mM EDTA, 10 mM NAD, 10 mM dithiothreitol, 4 u.g Salmon sperm 
DNA, and 15 nick-closing units of T. aquaticus ligase, and 
overlaid with a drop of mineral oil. Reactions were incubated at 

5 94°C for 1 min followed by 65°C for 4 min, and this cycle was 
repeated 20 or 30 times. 

The resulting samples were electrophoresed, gel 
autoradiographed overnight with the aid of a Cronex intensifying 
screen and the bands counted. The bands from the 

0 autoradiographed gel are depicted in figure 4, and the quantitation 
of LCR amplification tabulated in the following Table V. 
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TABLE V 



10 



15 



20 



25 



30 



Quantitation of LCR amplification the presence or absence of 
mismatched competitor molecules. 



Target 
molecules 



Complementary 
Oligonucleotides 
(101, 104) 
(A:T, TA) 

Product Ampli- 

fnrmftri fixation 



Complementary & Mismatched 

Oligonucleotides 

(101, 104 & 103, 106) 

(A:T, T:A & G:T, C:A) 

Product Ampli- Mismatched/ 

frirmfid fixation Cnmplflmentarv 



6x10 7 (B A ) 114 

2x10 7 93 

6 x 10 6 102 

2 x 10 6 90 

6 x 10 5 51 

2 x 10 5 31 

6 x 10 4 17 

2 x 10 4 8.6 

6 x 10 3 3.2 

0 <0.1 

6x10 7 (B S ) 2.1 



3.7 x10 4 

6.8 x 10 4 
1.0 x10 s 
1.3 x10 5 



93 

95 

93 

67 

46 

23 
9.3 
2.9 
0.8 

<0.1 
1.5 



1-0 
1.8 
0.5 
0.5 



2.8 x 10 4 
3.7 X 10 4 
3.5 x 10 4 
3.4 x 10 4 



At high target concentrations, sufficient mismatched 
product was produced to be visualized (as in figure 4), the 
quantity of mismatched product ranging from 1.8% to 0.5% of the 
complementary product. Use of an excess of mismatched target 
DNA (B s instead of I3 A globin DNA at 6 x 10? molecules per tube) 
gave only 2.1% and 1.5% product. The same amount of product may 
be formed when using three to ten thousand fold less 
complementary target DNA. Based upon this, the signal from 
correctly paired ligation products is 50 to 500 fold higher than 
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mismatched products under competition or individual LCR ligation 
conditions. 

At low target concentrations, the extent of DNA 
amplification ranged from 3.7 x 10* to 1.7 x 1()5 ( see Tables IV 
5 and V). Assuming the efficiency of ligation is the same in each 
cycle, the average amplification per cycle is between 40 and 50%. 

The efficiency per cycle could, of course, be potentially 
enhanced by altering buffer conditions, enzyme concentration, or 
thermal cycling times and temperatures - all within the 

1 0 capabilities of those skilled in the art. It has, for example, been 

shown that the ligation efficiency of thermophilic ligase (and 
other ligases) may be enhanced by altering buffer compositions, 
such as using NH 4 CI, HEPES, polyamines such as spermidine, or 
polyethylene glycols [see J. Biol. Chem 259:10041 (1984), and J. 
15 Biochem. 100:123 (1986)]. Varying the amounts of each 

component in the currently used buffer and either supplementing 
or exchanging one or more components with, but not. limited to, 
the chemical and biological components listed above, are among 
the methods of improving LCR that are straight forward for those 

2 0 skilled in the art. One skilled in the art can also easily vary the 

cycling times and temperatures. For example, at later time 
points, the majority of target present is oligonucleotide product 
from a previous LCR reaction. These oligonucleotides are short 
(preferably but not limited to 40-60 mers) and may melt more 
2 5 rapidly, allowing more rapid cycling. In the present invention, 
• successful ligase chain reactions have been completed for 30 and 



WO 91/17239 



85 



PCT/US91/02968 



40 cycles under cycling conditions of 94°C for 0.5 minutes 
followed by 65°C for 2 minutes (half the time of the 1 minute at 
94°C and 4 minutes at 65°C cycle time for the preferred ligase 
chain reaction conditions). Both the ligation temperature and the 
5 DNA denaturing temperatures may be varied with respect to 
actual degree, duration, and number of repeated cycles. Optimal 
conditions must maximize the amount of product formed in the 
presence of perfectly complementary target DNA, while 
minimizing the amount of incorrect product formed in the 

1 0 presence of mismatched target DNA or in the absence of 
complementary target DNA. 

Utilizing these findings, a method for the detection of 
specific sequences of oligonucleotides in clinical samples was 
developed. The source of the sample may be any material or 

1 5 substance which comprises nucleic acid. The nucleic acid need 
not be a naturally occurring nucleic acid, but may be synthesized 
by chemical, enzymatic, or biological means and may have other 
than naturally occurring purines and pyrimidines. The source of 
the clinical sample may be cellular or non-cellular, and may be 

20 derived from such physiological media as blood, serum, plasma, 
breast milk, stool, pus, tissue scrapings, washings, urine, or the 
like. Furthermore, the sample may be associated with a set or 
subset of cells, such as neoplastic cells, lymphocytes (for 
example, T-cells or B-cells, monocytes, neutrophils, etc); may 

25 include pathogens including viruses, bacteria, mycoplasma, fungi, 
protozoa, etc.; may include constructs, etc. or RNA, such as 
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1 



messenger RNA, transfer RNA, ribosomal RNA, viruses, or the like; 
and it may involve structural genes, untranslated regions, 
regulatory regions, introns, exons, or the like. In addition, the 
detection may be for a wide variety of purposes such as, for 
5 example, the diagnosis of a potential or actual disease state in 
plant or animal species, as well as the detection of sets or 
subsets of pathogens, the monitoring of genetic engineering, or 
the like. 

One such method for which the present invention may be 
1 0 used (and which clearly demonstrates the feasibility of direct 
LCR allelic detection from blood samples without the need for 
prior PGR amplification) is embodied, for example, in the 
detection of B-globin alleles in human genomic DNA. Based upon 
the high level of DNA amplification, the allele specific LCR 
15 detection of DNA was examined from blood collected from normal 
(B A BA) f carrier (BAbS), and sickle cell (BSfiS) individuals as 
more fully described in the following example: 
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EXAMPLE XI 

(detection of B-globin alleles in human genomic DNA) 
Human genomic DNA was isolated from 0.5 ml whole blood 
[see PCR Technology, H. A. Erlich editor, Stockton Press (1989) pg 
5 36]. Whole blood (0.5 ml) was mixed with an equal volume of 
lysis buffer (10 mM Tris-HCI, pH 7.6, containing 5 mM MgCl2 and 
0.32 M sucrose). After a brief centrifugation (1 min at 12,000 
rpm in an eppendorf desktop centrifuge), the supernatant was very 
carefully removed, leaving 0.15 to 0.2 ml of supernatant and 

1 0 loosely pelleted nuclei. The pellet was resuspended with 

vortexing in an additional 0.5 ml lysis buffer, nuclei pelleted and 
the supernatant removed as above. This step was repeated three 
or four times until the supernatant was clear or just barely pink. 
After removal of the final supernatant (again leaving about 0.15 
1 5 to 0.2 ml), 0.25 ml of LCR DNA Buffer containing non-ionic 
detergents (20 mM Tris-HCI, pH 7.6, containing 2 mM EDTA and 
0.45% each of non-ionic detergents NP40 and Tween 20) was 
added. Any excess RNA was digested by the addition of 2 u.1 of 4 
mg/ml heat treated RNase A for 15 min at 37°C. Any proteins 

2 0 were digested by the addition of 5 u.l of 10 mg/ml freshly made 

Proteinase K and incubation at 50°C for 1 to 2 hours. Proteinase 
K and RNase A were removed by sequential extractions with 
phenol, phenol/chloroform, chloroform, h-butanol (2X) and the 
nucleic acid recovered by precipitation with ethanol. Samples 
25 were boiled for 5 min prior to use in LCR assays. 
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Each isolated human genomic DNA was tested in two 
reaction mixtures, the first testing for the presence of the 
normal fiA allele, and the second testing for the presence of the 
sickle B s allele. The first reaction mixture contained B A test 

5 oligonucleotides 101 and 104 (0.27 ng or 40 fmoles each), 

labelled oligonucleotides (107 and 109; 200,000 cpm (0.28 ng or 
40 fmoles each), genomic DNA (corresponding to 10 jil of blood, or 
about 6 x 10* nucleated cells) in 10 u.l 20 mM Tris-HCI buffer, pH 
7.6, containing 100 mM KCI, 10 mM MgCfc, 1 mM EDTA, 10 mM NAD, 

0 10 mM dithiothreitol, and 15 nick-closing units of T. aquaticus 
ligase, and overlaid with a drop of mineral oil. The second 
reaction mixture contained B s test oligonucleotides 102 and 105 
(0.27 ng or 40 fmoles each), labelled oligonucleotides 107 and 
109 (200,000 cpm or 0.28 ng or 40 fmoles each), genomic DNA 

5 corresponding to 10 uJ of blood or about 6 x 10 4 nucleated cells) 
in 10 uJ 20 mM Tris-HCI buffer, pH 7.6 and containing 100 mM 
KCI, 10 mM MgCl2, 1 mM EDTA, 10 mM NAD, 10 mM dithiothreitol, 
and 15 nick-closing units of 7. aquaticus ligase, and overlaid 
with a drop of mineral oil. 

0 Both reaction mixtures were incubated at 94°C for 1 min 

followed by 65°C for 4 min, and this cycle was repeated 20 to 30 
times. Reactions were terminated by the addition of 8 jil 
formamide containing EDTA (10 mM), xylene cyanol (0.2%), and. 
bromphenol blue (0.2%). 

5 Samples (4 u.l) were denatured by boiling for three min prior 

to loading' (40,000 cpm/lane). Electrophoresis was in a 10% 
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polyacrylamide gel containing 7 M urea in a buffer of 100 mM Tris 
borate at ph 8.9 and 1 mM EDTA, for 2 hours at 60 Watt constant 
power. After removing the urea (10 min soak in 10% acetic acid, 
followed by 5 min soak in H2O). Gels were then dried onto 
5 Whatman 3 mm paper and autoradiographed overnight at -70°C on 
Kodak XAR-5 film with a DuPont Cronex intensifying screen. 
Ligation products of 45 and 46, or 47 and 48 nucleotides indicate 
the presence of the BA or BS globin gene, respectively. As noted 
with plasmid derived target DNA, the efficiency of ligation (and 
1 0 hence detection) is somewhat less for the B s than the B A 
specific oligonucleotides. 
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Figure 5 is an autoradiogram showing the detection of 13- 
globin alleles in human genomic DNA made in accordance with the 
proceeding example. Ligation products of 45 and 46, or 47 and 48 
nucleotides indicate the presence of the B A or B s globin gene, 
5 respectively. Thus, with target DNA corresponding to 10 jxl of 
blood, B A and BS alleles could be readily detected using allele 
specific LCR. 

Hence, the successful detection of a biologically derived 
nucleic acid test substance, which has a known normal nucleotide 

0 sequence and a known possible mutation at at least one target 
nucleotide position in the sequence, requires (1) a first reaction 
mixture comprising two sets of adjacent oligonucleotides 
complementary to each other, complementary to the target 
sequence nucleic acid, wherein there is at least one mismatched 

5 base pair to the mutant target sequence nucleic acid, but not to 
the normal target sequence nucleic acid at the junction of the 
adjacent oligonucleotides; (2) a second reaction mixture 
comprising two sets of adjacent oligonucleotides complementary 
to each other, complementary to the target sequence nucleic acid, 

0 wherein there is at least one mismatched base pair to the normal 
target sequence DNA, but not to the mutant target sequence 
nucleic acid at the junction of the adjacent oligonucleotides; (3) a 
thermostable ligase which does not become irreversibly 
denatured and lose its catalytic ability when subjected to 

5 temperatures of from about 50°C to about 105°C; and (4) 

subjecting these ligase mixtures to repeated temperature cycle 
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which comprises a first temperature to denature the DNA (in a 
•range of about 90°C to about 105°C), and a second temperature to 
allow for hybridization/ligation (in the range of about 50°C to 
about 85°C) - this also allows adjacent oligonucleotides in each 
5 reaction mixture to become possibly covalently linked; (5) 
separating the test substance and any unlinked test 
oligonucleotides from covalently linked oligonucleotide product 
(if formed); and (6) detecting the presence or absence of 
covalently linked oligonucleotides in each reaction mixture 

10 whereby the presence of covalently linked oligonucleotide product 
in the first reaction mixture indicates the presence of normal 
target sequence and the presence of covalently linked 
oligonucleotide product in the second reaction mixture indicates 
the presence of mutant target sequence. In the detection of B A 

15 and US globin alleles described above, the components were (1) 
oligonucleotides 101, 104, 107 and 109; (2) oligonucleotides 102, 
105, 107 and 109; (3) T. aquaticus ligase; (4) 30 temperature 
cycles of 94°C for 1 min followed by 65°C for 4 min; (5) 
denaturing nucleic acids by boiling in 45% formamide and 

2 0 separating on a sequencing gel; and (6) autoradiographing of the 
gel. 

This clearly demonstrates the feasibility of direct LCR 
allelic detection from blood samples according to the present 
invention without the need for PCR amplification. 
25 As noted with plasmid derived target DNA, the efficiency of 

ligation (and hence detection) is somewhat less for the BS than 
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the B A specific oligonucleotides. After 30 cycles of 
amplification, B*> specific products were approximately one-third 
of BA specific products, as quantitated by assaying excised 
products for radioactivity. These differences may be a function 

5 of the exact nucleotide sequence at the ligation junction, or the 
particular oligonucleotides (with differing 5' tails) used in the 
LCR experiments. However, the present invention still allows for 
a direct competition assay where two oligonucleotides are 
differentially labelled (for example with fluorescent groups or, in 

0 this case, with different length tails) to determine the presence 
or absence of either allele in a reaction mixture. In the 
generalized form, the method according to the present invention 
allows one to assay two alleles in the same vessel, providing the 
sets of oligonucleotides containing at least one mismatched base 

5 pair to the mutant target sequence nucleic acid, but not to the 
normal target sequence nucleic acid at the junction of the 
adjacent oligonucleotides, are labelled with one set of labels, and 
the oligonucleotides containing at least one mismatched base pair 
to the normal target sequence nucleic acid, but not to the mutant 

0 target sequence nucleic acid at the junction of the adjacent 
oligonucleotides, are labelled with a different label. 

In a comparable non-radioactive assay, as depicted in figure 
6, a minimum of two oligonucleotide probes are synthesized and 
modified for particular functions in the ligation assay. One probe 

5 contains a hook that permits the capture of the oligonucleotide 
following ligation. An example of such a hook is biotin which can 
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be captured by streptavidin or avidin bound to appropriate 
supports. The other probe has a reporter group. Although a 
variety of reporter groups, both radioisotopic and non- 
radioactive, are available and can be used with the assay 
5 according to the present invention, such as fluorophores or 
luminescent moieties, the currently preferred reporter is one 
which may participate in an ELISA (enzyme-linked immuno 
sorbent assay). More specifically, figure 6 depicts a schematic 
diagram of an ELISA based oligonucleotide ligation assay in which 

10 biotinylated (B) and digoxigenin-labelled (D) oligonucleotides are 
hybridized with a DNA target in the presence of ligase (arrow). 
Biotinylated oligonucleotides are captured on streptavidin (SA) 
coated within the wells of microtiter plates. The wells are 
washed to remove unbound oligonucleotides, and alkaline 

1 5 phosphatase (AP) conjugated anti-digoxigenin antibodies ( D) are 
added to the wells. Following an incubation and wash cycle, 
alkaline phosphatase substrate (S) is added, and digoxigenin 
detected by the production of a color product. 

The non-radiolabelled assay according to the present 

20 invention consists of several steps: (1) preparation of the DNA 
target; (2) denaturation and hybridization of the modified 
oligonucleotide probes; (3) ligation; (4) capture of the 
biotinylated probe; (5) washing to remove free nonbiotinylated 
oligonucleotides and target; (6) addition of alkaline phosphatase 

25 conjugated anti-digoxigenin antibodies; (7) washing to removed 
unbound antibody; (8) addition of alkaline phosphatase substrate; 
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and (9) spectrophotometry analysis. The following flow chart 
details the general procedure (which has automated on a modified 
Biomek 1000 workstation instrument) by which a non- 
radiolabelled assay according to the present invention can be 
5 conducted: 
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Amplified Target DNA 



T4 Ugase Detection Tag Ugase Detection 

5 

Denature residual Taq polymerase by adding: 



10 45 \jd of 0.3 N NaOH 45 pJ of 0.1 N KOH 



Renature target DNA by adding: 



45 jil Of 0.3 N HCI 45 \ii of 0.1 N HCI 



Distribute amplified target to microtiter plates at 10 pJ per well 

Add biotinylated and reporter oligonucleotides to DNA targets 
(200 fmoles of each oligonucleotide in 10 pJ of 2 X ligation mix) 



25 Ugation Mix: 

200 fmole biotinylated oligo 
200 fmole reporter oligo 
100 mM Tris-HCI, pH 7.5 
20 mM MgCl2 
3 0 1 0 mM DTT 
2 mM ATP 
2 mM Spermidine 
50% Formamide 

35 



Denature target oligonucleotide mix at 93°C for 2 minutes 

40 

Cool to room temperature and add 5 nJ T4 ligase in 
200 mM NaCI 
50 mM Tris-HCI, pH 7.5 
10 mM MgCl2 
4 5 5 mM DTT 
1 mM ATP 
1 mM Spermidine 

Ligate at room temperature (25°C) for 15 minutes 

50 



Ligation Mix: 

200 fmole biotinylated oligo 
200 fmole reporter oligo 
100 mM Tris-HCI, pH 7.5 
20 mM MgCl2 
10 mM DTT 
2 mM ATP 
2 mM Spermidine 
2 mM NAD 
100 mM KCI 
Taq Ligase 



Cool to 60-68°C and ligate 
for 15 minutes (repeat 
denaturation and ligation 
step to amplify) 
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25 



30 



Stop ligation reaction and denature products by adding 10 nJ of 0.3 N NaOH 
Neutralize reactions by adding 4 of 3 M sodium acetate 

Transfer reactions to an avidin coated and blocked microtiter plate 
(avidin coating - 60 \ig of avidin/well in 60 jiJ of PBS, pH 7.0 for 60 min at 37°C; 
blocking - remove avidin from the plate and add 200 jil/weil of 100 mM Tris-HCI,' 
150 mM NaCI, 0.05% Tween, 0.5% dry miik, and 100 p.g/m! of salmon sperm DN A) 

Capture biotinylated oligonucleotide at room temperature for 30 minutes 



Wash plate to remove unbound oligonucleotides and targets with (1) 100 mM Tris- 

1 5 HCt, pH 7.5, in 150 mM NaCI in 0.05% Tween; (2) 0^01 N NaOH in 0.05% Tween- 

and (3) 100 mM TRIS-HCI, pH 7.5 in 150 mM NaCI in 0.05% Tween 

Add alkaline phosphatase conjugated antibody to the reporter oligonucleotide; 30 uJ per 

2 0 well in 100 mM TRIS-HCI, pH 7.5, 150 mM NaCI, 0.5% dry milk and 0.05% Tween 



Incubate plates for 30 min at room temperature for antibody binding to the reporter 

Wash the plate with 100 mM TRIS-HCI, pH 7.5, 150 mM NaCI in 0.05% Tween to 

remove unbound antibody 

Add substrate 



Read plate for appropriate colormetric, chemiluminescent, or fluorescent product 
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Genomic sequences required to begin this assay can be 
amplified by a number of different methods, including LCR, 3SR, 
and PCR. We have used PCR amplification to obtain DNA targets 
listed on the following Table VI for litigation assay primers: 

TABLE VI 

(sequences of amplification primer sets) 

Target Gene Amplification Primer? 

B -g lobin CAACTTCATCCACGTTCACCTTGCC 

AGGGCAGGAGCCAGGGCTGGGG 

alphai -antitrypsin TCAGCCTTACAACGTGTCTCTGCTT 

GTATGGCCTCTAAAAACATGGCCCC 

cystic fibrosis CAGTGGAAGAATGGCATTCTGTT 

GGCATGCTTTGATGACGCTTCTG 



DNA amplification was performed using 5 u.1 of DNA (2 
ng/u.1 for genomic DNA or 5 u.1 of treated material from an 
alternative source) is mixed with a pair of primer 
oligonucleotides (0.5 uM each) specific for the region of DNA to 

20 be amplified in a PCR buffer containing 0.05 U/u.1 of Taq 

polymerase, 50 mM KCI, 25 mM Tris HCI buffer at pH 8.3, 10 mM 
MgCl2, 200 ug/ml gelatin, 0.1% Triton X-100, and 1.5 mM each of 
dATP, dCTP, dGTP and dTTP. The sample was overlaid with 60uJ 
of light mineral oil, denatured at 93°C for 5 min target, and 

25 subjected to 40 cycles consisting of 20 sec at 93°C, 40 sec at 
55°C, and 1 min at 72°C. Following temperature cycling, the 
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sample was subjected to 10 min at 72°C to complete extension of 
the DMA sample. 

Oligonucleotides are synthesized and modified for 
particular functions in the ligation assay: The assay requires a 
5 minimum of two modified oligonucleotides. One oligonucleotide 
has a hook that permits capture of the oligonucleotide following 
ligation. An example of this is a biotinylated oligonucleotide 
which can be captured on streptavidin or avidin supports. The 
other oligonucleotide has a reporter group which, in the case of a 

10 fluorophore reporter, multiple reporters with different emission 
spectra could easily be incorporated into a single assay. 

For an ELISA based system, probes which discriminate 
allelic forms of a gene are synthesized with a 5' biotin group. 
Reporter probes are enzymatically or chemically 5'- 

15 phosphorylated and labelled with the hapten digoxigenin. The 
hapten is added to the 3' end of the reporter probe by tailing 500 
pM of oligonucleotide at 37°C for 1 hour in 10 mM potassium 
cacodylate, pH 7.0, 1 mM CoCl2, 0.1 mM DTT, 5 nM of digoxigenin 
dUTP, 0.05 u.M of dATP, and 100 units of the enzyme terminal 

20 transferase in a total volume of 20 u.L After labelling, 2 pi of 3 
M sodium acetate and 1 pi of yeast t-RNA (1 mg/ml) and 60 pi of 
95% ethanol is added". The oligonucleotide is precipitated at 4 e C 
for 5 min and then collected by centrifugation at 6500 x g for 5 
minutes. The pellet is resuspended in 20 pi of distilled water and 

25 the process repeated. This precipitation removes unconjugated 
excess digoxigenin from the labelled probe. Example of 
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oligonucleotides which discriminate alleles for three pathologic 
states are given in the following table VII: 

TABLE VII 

(sequences of example oligonucleotides for ELISA detection) 



Labelled (L) 
Primer 



B-alobin B* B1 -ATQCTGCACCTGACTCCTGA 

8 s B2ATQGTQCACCTGACTCCTGiT 

alphai M m<3QCTGTGCTGWXATCGACG 

antl-trypsin PGWB&eKMttGCV 

Z B2<3GCTGTX3CTGACX}ATCGACA 

15 cystic non-508 B1 -ATTAAAGAAAATATCATCTT 

fibrosis TGCrrSTTTCCTATt^TGAAT 



Utilizing the procedure contained in the previous flow chart, 
20 a number of experiments were run and, after color development, 
data were obtained spectrometrically at a wavelength of 490 mN. 
Typical results for such tests have been tabulated in the 
following table VIII: 
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TABLE VIII 

(spectrophotometric data from automated ligation 
reactions using Taq ligase) 

Ligation Primpr Miy 

Amplified Genomic 

DNA Tar?ftt Frngy B1 ± L BP ± I 

10 B - globin 

B A 1.27 ± 0.06 0.01 ± 0.01 

B s 0.04 ± 0.03 1 .85 ± 0.03 
alphai -antitrypsin 

M 1.85 ± 0.15 0.03 ± 0.01 

1 5 2 0.03 ± 0.03 1 .47 + 0.07 

cystic fibrosis: 

non-508 1.33 + 0.20 0.02 ± 0.01 

508 0.01 ± 0.01 1.66 ± 0.16 

20 Comparable levels of detection were achieved with either 

T4 or Taq ligase. In addition, a number of ligation reactions have 
been performed for several other disease associated 
polymorphisms with comparable results. Additionally, eight 
different polymorphisms in the human T cell receptor loci have 

25 been examined with similar detection results. The present 
invention, therefore, appears to be generally applicable in the 
analysis of DNA polymorphisms consisting of single base 
substitutions, DNA deletion or insertions, or DNA translations. 

In addition, a number of alkaline phosphatase substrates can 

3 0 be employed in the ELJSA assay of the present invention including 
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sensitive chemiluminescent substrates (10 attomole detection). 
The format of the assay is easily adapted to other reporter 
formats such as fluoropores which can be read in the appropriate 
microtiter format. Incorporation of the appropriate fluorophore 
5 format would, for example, permit multiplex analysis by ligation. 
In this scheme, oligonucleotides discriminating different alleles 
and/or different genes could be evaluated in a single assay. 
Furthermore, it is also possible that tandem ligation assays 
(ligation of oligonucleotides in chains) could be employed to 

1 0 assess closely spaced DNA polymorphisms such as those which 
exist in the major histocompatibility complex genes. Such 
modifications to the assay specifically depicted above are 
considered to be well within the scope of the present invention 
The present invention can be used in a wide variety of DNA 

1 5 diagnostic screening. For example, and not intending to limit the 
scope of the present invention, such DNA diagnostic screens may 
include those according to the following summary: 
A - INFECTIOUS DISEASES: 

1. Viral Diseases: HIV, EBV, HPV, HSV, CMV, Hepatitis 

2 0 (non-A, non-B) 

(i) blood and tissue screening 

(ii) rapid identification 

(iii) distinguish chronic infection from past exposure 

(iv) distinguish resistant strains in mixed infection 
2 5 2. Bacterial Diseases: Mycobacteria, Syphilis, Clamydia, 

Legionella, Campylobacter; Pneumonocystis, Lysteria, 
Lyme, Leprosy 

(i) rapid identification of slow growing microbes 
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(ii) identification in immuno-deficient patients 

(iii) testing food for contamination 

3. Parasitic Diseases: Malaria, Trypanosomes, Leishmania 

(i) rapid identification of "third world" blood 
5 diseases 

(ii) screening travelers and armed forces 

B - GENETIC DISEASES: 

1. Single Allele Diseases: Cystic Fibrosis, Duchenne's 

muscular dystrophy, Sickle Cell Anemia, B-thalasemia, 
1 0 Haemophilia A, Gaucher, Tay-Sachs, Alsheimer's, 

Neurofibromatosis 

2. Cancer: Retinoblastoma, Wilms tumor, Colon, Breast, 
Oncogenes, Tumor supressors 

3. Multiple Allele Diseases: Coronary heart disease, 

1 5 Diabetes, High blood pressure, Schizophrenia, Manic- 

depression, Alcohol abuse 

(i) predisposition to disease 

(ii) preventive medicine, exercise, diet 

(iii) genetic screening and counseling 
20 (iv) gene therapy. 

C - GENETIC IDENTIFICATION: 

1. Humans: HLA typing, forensics 

(i) tissue transplantation 

(ii) genetic linkage analysis 
25 (iii) human genome program 

(iv) positive identification of missing children 

2. Animals: Horses, Dairy cows, Cattle, Domestic pets 

(i) pure genetic characteristics 

(ii) confirm breeding lineage 

30 (iii) positive identification of animals 

3. Plants: Seed Stock 

(i) assure genetic diversity 

(ii) identify strains resistant to drought and disease 
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Thus, while we have illustrated and described the preferred 
embodiment of our invention, it is to be understood that this 
invention is capable of variation and modification, and we 
therefore do not wish to be limited to the precise terms set 
5 forth, but desire to avail ourselves of such changes and 
alterations which may be made for adapting the invention to 
various usages and conditions. Accordingly, such changes and 
alterations are properly intended to be within the full range of 
equivalents, and therefore within the purview of the following 
10 claims. 

Having thus described our invention and the manner and a 
process of making and using it in such full, clear, concise and 
exact terms so as to enable any person skilled in the art to which 
it pertains, or with which it is most nearly connected, to make 
1 5 and use the same; 
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WE CLAIM: 

1 . The cell line AK76 designated as ATCC 55032. 

2. A plasmid selected from pDZ1 and designated as ATCC 
68307, and pDZ7 and designated as ATCC 68308. 

3. A purified isolated DNA fragment or nucleic acid 
sequence that hybridizes to such sequence under high stringency 
consisting essentially of a DNA sequence coding for a 
thermostable Thermus aquaficus strain HB8 ligase enzyme. 

4. A purified isolated DNA fragment or nucleic acid 
sequence that hybridizes to such fragment under high stringency 
which comprises the partial sequence coding for a thermostable 
ligase having the nucleic acid sequence 

ATG ACC CTG GAA GAG GCG AGG AAG CGG GTA AAC GAG TTA CGG GAC 
CTC ATC CGC TAC CAC AAC TAC CGC TAG TAG GTC CTG GOG GAC CCG 
GAG ATC TCC GAC GCC GAG TAG GAG CGG CTT CTT AGG GAG CTC AAG 
GAG CTT GAG GAG CGC TTC CCC GAG CTC AAA AGC COG GAC TOC GGC 
ACC CTT CAG GTG GGG GCG AGG OCT TTG GAG GCC ACC TTC CGC GCC 
GTC(D3C(^CCCCACCCGCATGTACT0CTTC 

CTT GAC GAG CTC AAG GCC TTT GAG GAG CGG ATA GAA OGG GCC CTG 
GGG CGG AAG GGC 000 TTC GCC TAC ACC GTG GAG CAC AAG GTG GAC 
GGG CTT TCC GTG AAC CTC TAC TAC GAG GAG GGG GTC CTG GTC TAC 
GGG GCC ACC GCC GGG GAC GGG GAG GTG GGG GAG GAG GTC ACC CAG 
AAC CTC CTC ACC ATC CCC ACC ATC CCG AGG AGG CTC AAG GGG GTG 
CCG GAG CGC CTC GAG GTC CGG GGG GAG GTC TAC ATG CCC ATA GAG 
GCC TTC CTC CGG CTC AAC GAG GAG CTG GAG GAG CGG GGG GAG AGG 
ATC TTC AAA AAC OCT AGG AAT GCG GCG GCG GGT TCC TTA AGG CAA 
AAA GAC CCC CGC ATC ACC GCC AAG OGG GGC CTC AGG GGC ACC TTC 
TAC GCC TTA GGG CTT GGG CTG GAG GAG GTG GAG AGG GAA GGG GTC 
GCG ACC CAG TTT GCC CTC CTC CAC TGG CTC AAG GAA AAA GGC TTC 
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COC GTG GAG CAC GGC TAC GCC CGG GCC GTG GQG GCG GAA GGG GTG 
GAG GOG GTC TAG CAG GAC TGG CTC AAG AAG CGG CGG QCG CTT COC 
TTT GAG GCG GAC GQG GTG GTG GTG AAG CTG GAC GAG CTT GCC CTT 
TGG CGG GAG CTC GGC TAC ACC GOC CGC GCC COC OGG TTC GOC ATC 
GCC TAC AAG TTC OCC GCC GAG GAG AAG GAG AOC COG CTT TTG GAC 
GTG GTC TTC CAG GTG GGG CGC ACC GGG CGG GTG ACC COC GTG GGG 
ATC CTC GAG COC GTC TTC CTA GAG GGC AGO GAG GTC TOO OGG GTC 
AOC CTG CAC AAC GAG AGC TAC ATA GAG GAG TTG GAC ATC CGC ATC 
GQG GAC TGG GTT TTG GTG CAC AAG GCG GGC GGG GTC ATC COC GAG 
GTC CTC CGG GTC CTC AAG GAG AGG CGC ACQ GGG GAG GAA AGG COC 
ATT CGC TGG COC GAG ACC TGC CGC GAG TGC GGC CAC CGC CTC CTC 
AAG GAG GGG AAG GTCCACCGCTGCCCCAACCOCTTGTGOCCCGC^ 
AAG CGC TTT GAG GCC ATC CGC CAC TTC GCC TCC CG^ AAG GCC ATG 
GAC ATC CAG GOC CTG GGG GAA AAG CTC ATT GAG AGG CTT TTG GAA 
AAG GGG CTG GTC AAG GAC GTG GCC GAC CTC TAC CGC TTG AGA AAG 
GAA GAC CTG GTG GGC CTG GAG CGC ATG GGG GAG AAG AGC GCC CAA 
AAC CTC CTC CGC GAG ATA GAG GAG AGC AAG AAA AGA GGC CTG GAG 
CGC CTC CTC TAC GCC TTG GGG CTT 000 GGG GTG GQG GAG GTC TTG 
GCC OGG AAC CTG GCG GCC CGC TTC GGG AAC ATG GAC CGC CTC CTC 
GAG GCC AGC CTG GAG GAG CTC CTG GAG GTG GAG GAG GTG GGG GAG 
CTC ACQ GCG AGG GCC ATC CTG GAG ACC TTG AAG GAC COC GCC TTC 
CGC GAC CTG GTA CGG AGG CTC AAG GAG GCG GGG GTG GAG ATG GAG 
GCC AAG GAG AAG GGC GGG GAG GCC CTT AAA GGG CTC ACC TOO GTG 
ATC AOC GGG GAG CTT TCC CGC COC CGG GAA GAG GTG AAG GCC CTC 
CTA AGG CGC CTC GGG GCC AAG GTG AGG GAC TCC GTG AGC CGG AAG 
ACG AGC TAC CTC GTG GTG GGG GAG AAC COG GGG GAG AAC COG GGG 
AGC AAG CTG GAG AAG GCC AGG GOO CTC GGG GTC COC ACC CTC ACG 
GAG GAG GAG CTC TAC CGG CTC CTG GAG GOG CGG ACG GGG AAG AAG 
GCG GAG GAG CTC GTC TAA AGGCTTCC. 

5. An expression vector comprising a DNA sequence 

encoding for a thermostable ligase selected from the group 

consisting of (1) Thermus aquaticus HB8 ligase; (2) a 

thermostable ligase having at least 6 sequential amino acid 
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residues corresponding to 6 sequential amino acid residues in 
Thermus aquaticus HB8 ligase; "and (3) a ligase active mutant of 
Thermus aquaticus HB8 ligase or a fragment thereof wherein an 
amino acid residue has been inserted, substituted or deleted in or 
from the amino acid sequence of the ligase or its fragment. 

6. A purified isolated polypeptide which catalyses the 
formation of a phosphodiester bond at the site of a single- 
stranded break in duplex DNA at temperatures of about 50°C to 
about 85°C, and which does not become irreversibly denatured and 
lose its catalytic ability when subjected to temperatures of from 
90°C to about 105°C, 

7. A purified isolated polypeptide having the amino acid 
sequence: 



M=fc 


Thr 


Leu GLu 


Glu 


Ala 


Arg Lys Arg 


Val 


Asn Glu Leu Arg 


Asp 


Leu 


lie 


Arg Tyr 


His 


Asn 


Tyr Arg Tyr 


Tyr 


Val Leu Ala Asp 


Pro 


Glu 


lie 


Ser Asp 


Ala 


Glu 


Tyr Asp Arg 


Leu 


Leu Arg Glu Leu 


Lys 


Glu 


Leu 


Glu Glu 


Arg 


Phe 


Pro Glu Leu 


Lys 


Ser Pro Asp Ser 


Pro 


Thr 


Leu 


Gin Val 


Gly 


Ala 


Arg Pro leu 


Glu 


Ala Thr Phe Arg 


Pro 


val 


Arg 


His Pro 


Thr 


Arg 


M=t Tyr Ser 


Leu 


Asp Asn Ala Phe 


Asn 


Leu 


Asp 


Glu Leu 


Lys. 


Ala 


Phe Glu Glu 


Arg 


lie Glu Arg Ala 


Leu 


Gly 


Arg 


Lys Gly 


Pro 


Phe 


Ala Tyr Thr 


Val 


Glu His Lys . Val 


Asp 


Gly 


Leu 


Ser Val 


Asn 


Leu 


Tyr Tyr Glu 


Glu 


Gly Val Leu Val 


Tyr 


Gly 


Ala 


Thr Arg 


Gly 


Glu 


Gly Glu Val 


Gly 


Glu Glu Val Thr 


Gin 


Asn 


Leu 


Leu Thr 


He 


Pro 


Thr lie Pro 


Arg 


Arg Leu Lys Gly 


Val 


Pro 


Glu 


Arg Leu 


Glu 


Val 


Arg Gly Glu 


Val 


Tyr Met Pro He 


Glu 


Ala 


Phe 


Leu Arg 


Leu 


Asn 


Glu Glu Leu 


Glu 


Glu Arg Gly GLu 


Arg 


lie 


Phe 


Lys Asn 


Pro 


Arg 


Asn Ala Ala 


Ala 


Gly Ser Leu Arg 


GLn 


Lys 


Asp 


Pro Arg 


He 


Thr 


Ala Lys Arg 


Gly 


Leu Arg Ala Thr 


Phe 


Tyr 


Ala 


Leu Gly 


Leu 


Gly 


Leu Glu GLu 


Val 


Glu Arg Glu Gly 


Val 


Ala 


Thr 


Gin Phe 


Ala 


Leu 


Leu His Trp 


Leu 


Lys Glu Lys" Gly 


Phe 
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Pro Val Glu His Gly Tyr Ala Arg Ala Val Gly Ala Glu Gly Val 
Glu Ala Val Tyr Gin Asp Trp Leu Lys Lys Arg Arg Ala Leu Pro 
Phe Glu Ala Asp Gly Val Val Val Lys Leu Asp Glu Leu Ala Leu 
Try Arg Glu Leu Gly Tyr Thr Ala Arg Ala Pro Arg Phe Ala lie 
Ala Tyr Lys Phe Pro Ala Glu Glu Lys Glu Thr Arg Leu Leu Asp 
Val Val Phe Gin Val Gly Arg Thr Gly Arg Val Thr Pro Val Gly 
He Leu Glu Pro Val Phe Leu Glu Gly Ser Glu Val Ser Arg Val 
Thr Leu His Asn Glu Ser Tyr He Glu Glu Leu Asp lie Arg lie 
Gly Asp Trp Val Leu Val His Lys Ala Gly Gly Val He Pro Glu 
Val Leu Arg Val Leu Lys Glu Arg Arg Thr Gly Glu Glu Arg Pro 
He Arg Trp Pro Glu Thr Cys Pro Glu Cys Gly His Arg Leu Leu 
Lys Glu Gly Lys Val His Arg Cys Pro Asn Pro Leu Cys Pro Ala 
Lys Arg Phe Glu Ala He Arg His Phe Ala Ser Arg Lys Ala Met 
Asp He Gin Gly Leu Gly Glu Lys Leu lie Glu Arg Leu Leu Glu 
Lys Gly Leu Val Lys Asp Val Ala Asp Leu Tyr Arg leu Arg Lys 
Glu Asp Leu Val Gly Leu Glu Arg Mat Gly Glu Lys Ser Ala Gin 
Asn Leu Leu Arg Gin He Glu Glu Ser Lys Lys Arg Gly Leu Glu 
Arg Leu Leu Tyr Ala Leu Gly Leu Pro Gly Val Gly Glu Val Leu 
Ala Arg Asn Leu Ala Ala Arg Phe Gly Asn Met Asp Arg Leu Leu 
Glu Ala Ser Leu Glu Glu Leu Leu Glu Val Glu Glu Val Gly Glu 
Leu Thr Ala Arg Ala He Leu Glu Thr Leu Lys Asp Pro Ala Phe 
Arg Asp Leu Val Arg Arg Leu Lys Glu Ala Gly Val Glu Met Glu 
Ala Lys Glu Lys Gly Gly Glu Ala Leu Lys Gly Leu Thr Phe Val 
He Thr Gly Glu Leu Ser Arg Pro Arg Glu Glu Val Lys Ala Leu 
Leu Arg Arg Leu Gly Ala Lys Val Thr Asp Ser Val Ser Arg Lys 
Thr Ser Tyr Leu Val Val Gly Glu Asn Pro Gly Ser Lys Leu Glu 
Lys Ala Arg Ala Leu Gly Val Pro Thr Leu Thr Glu Glu Glu Leu 
Tyr Arg Leu Leu Glu Ala Arg Thr Gly Lys Lys Ala Glu Glu Leu 
Val 

8. A purified polypeptide isolated from a recombinant 
organism transformed with a vector that codes for the expression 
of Thermus aquaticus ligase which polypeptide catalyses the 
formation of a phosphodiester bond at the site of a single- 
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stranded break in duplex DNA at a temperature of about 50°C to 
about 85°C, and which polypeptide does not become irreversibly 
denatured and lose its catalytic ability when subjected to 
temperatures in the range of from about 90°C to about 105°C. 

9. A purified isolated ligase which catalyses the 
formation of a phosphodiester bond between two adjacent 
oligonucleotides hybridized to a complementary strand of DNA at 
a temperature in the range of about 50°C to about 85°C. 

10. A purified isolated ligase which catalyses ligation of 
two adjacent oligonucleotides hybridized to a complementary 
target sequence of DNA at a temperature in the range of about 
50°C to about 85°C, wherein the products formed of the ligation 
are about 50 to about 500 more than if a single base mismatch is 
present at the junction of the adjacent oligonucleotides. 

11. A purified isolated ligase which retains the ability to 
catalyze formation of a phosphodiester bond between two 
adjacent oligonucleotides hybridized to a complementary strand 
of DNA at a temperature of about 50°C to about 85°C after 
repeated prior exposure to temperatures of from about 90°C to 
about 105°G for about 0.25 min to about 4 minutes. 

12. A purified isolated ligase which retains the ability to 
catalyze formation of a phosphodiester bond between two 
adjacent oligonucleotides hybridized to a complementary strand 
of DNA at a temperature range of about 50°C to about 85°C and 
wherein the products formed of the ligation are about 50 to about 
500 more than if a single base mismatch is present at the 
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junction of the adjacent oligonucleotides, after repeated prior 
exposures to a temperature range of from about 90°C to about 
105°C for about 0.25 min to about 4 minutes. 

13. A method for amplifying a nucleic acid test substance 
of known nucleotide sequence comprising: 

(1) providing a reaction mixture comprising a first 
set of two adjacent oligonucleotides complementary to and 
in molar excess of a target sequence nucleic acid and 
further having no mismatch to the target sequence DNA at 
the junction of the adjacent oligonucleotides; 

(2) providing a thermostable ligase which does not 
become irreversibly denatured and lose its catalytic ability 
when subjected to temperatures of from about 50°C to 
about 105°C; and 

(3) subjecting the ligase-mixture to at least two 
temperature cycles which comprises a first temperature 
range of about 90°C to 105°C and a second temperature 
range of about 50°C to about 85°C. 

14. A method for detecting a biologically derived nucleic 
acid test substance having a known normal nucleotide sequence 
and a known possible mutation at at least one target nucleotide 
position in the sequence which comprises: 

(1) providing a reaction mixture comprising two 
adjacent oligonucleotides complementary to the target 
sequence nucleic acid and wherein the oligonucleotides have 
at least one mismatched base pair to the mutant target 
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sequence nucleic acid but not to the normal nucleotide 
sequence nucleic acid at the junction "of the adjacent 
nucleotides; 

(2) providing a second reaction mixture comprising 
two adjacent oligonucleotides complementary to the target 
sequence nucleic acid and having at least one mismatched 
base pair to the normal target sequence nucleic acid, but no 
mismatch to the mutant target sequence nucleic acid at the 
junction of the adjacent oligonucleotides; 

(3) providing a thermostable ligase which does not 
become irreversibly denatured and lose catalytic activity 
when subjected to temperatures of from 50°C to about 
105°C to each of the first and second reaction mixtures; 

(4) subjecting each of the ligase-mixture to at least 
one temperature cycle which comprises a first temperature 
of from about 90°C to about 105°C and a secondtemperature 
of about 50°C to about 85°C; 

(5) allowing the adjacent oligonucleotides in each 
reaction mixture to become possibly covalently linked; 

(6) separating the test substance and unlinked 
oligonucleotides in each reaction mixture from possible 
covalently linked oligonucleotides; and 

(7) detecting the presence or absence of covalently 
linked oligonucleotide product in each reaction mixture 

wherein the presence of covalently linked oligonucleotides in the 
first reaction mixture indicates the presence of normal sequence 
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and the presence of covalently linked oligonucleotides in the 
second mixture indicates the presence of mutant sequences. 

15. A kit for assaying a biologically derived DNA or RNA 
test substance which has a known normal nucleotide sequence and 
a known possible mutation at at least one target nucleotide 
position in the sequence, the kit comprising: 

(1) a container holding a first reaction mixture 
comprising two adjacent oligonucleotides complementary to 
the target sequence nucleic acid, wherein one 
oligonucleotide is labelled and there is at least one 
mismatched base pair to the mutant target sequence 
nucleic acid, but not to the normal target sequence nucleic 
acid at the junction of the adjacent oligonucleotides; 

(2) a container holding a second reaction mixture 
comprising two adjacent oligonucleotides complementary to 
the target sequence nucleic acid, wherein one 
oligonucleotide is labelled, wherein there is at least one 
mismatched base pair to the normal target sequence nucleic 
acid, but not to the mutant target sequence nucleic acid at 
the junction of the adjacent oligonucleotides; and 

(3) a thermostable ligase which does not become 
irreversibly denatured and lose its catalytic ability when 
subjected to temperatures of from 50°C to about 105°C. 
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